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Preservice teachers ' perceptions of reading instruction 

"Teachers teach as they are taught and not as they have been instructed to 
tpJh " is a famSar hypothesis expressed by disheartened teacher educators. 

the gathSing of data over a fourteen-or-more-year per.od. would tend to 
^^^S^T^ of observed differences noted by the 

mmmmm 

could adjust instruction accordingly. 



Method 



SU AmS start' of the 1975 Spring semester, 110 prdfeervice teachers enrolled 
of reading instruction. 



Ma terials and Procedure 

Thfe PTPQ! designed to determine preservice teachers' perceptions of reading- 
practices, consisted of an* 83-itefn t< seif-repprt questionnaire. The items were 
designed by the authc and wer§ partially based on the differences noted 
between suggested practices .and actual performances' observed of prior 
students in the practicum sessions of the course, "Diagnostic Teaching of 
Reading." 

The format of. the questionnaire called for two written responses per item. 
After reading the item the students were directed to first mark a'space denoting 
the frequency an instructional reading practice was judged to have taken place. 
Always, sometimes, seldom, and never were response choices provided. In 
addition, the subjects were given the opportunity to mark> outside of the fre- 
quency response box if they were unable 'to recall thd instructional practice. 
This was labeled an "I forget" response. The second written response called 
for the subjects to judge the explicitness of the item. Clear and unclear choices 
were provided for this response. 
*ln addition to the oral directions given to subjects for marking (he items of the 
' questionnaire, overall set for responding -to the instrument was provided. The 
preservice teachers were instructed to try and remember their experiences 
when receiving reading instruction and to -make their responses based op these 
^memories'! 

• Three forms of the questionnaire were administered simultaneously to each of 
the sQven sections of preservice teachers. This was managed by having 
•approximately a third of the subjects in eaqfi section respond to one of three 
forms. The three forms resulted from reordering of the 83 items. 

Responses obtained from administering the PTPQ were analyzed in order to 
check the soundness of the instrument as a diagnostic device, as well as for 
interpreting purposes. . 

The estimates of soundness were based upon( (1 ) the explicitness of the rteme, 
(2) the consistency of the instrument, (3) the # logical relationships existing 
between similar items, and (4) the variability of the responses. Interpretation of 
the data resulted from classifying related teaching practices into categories and 
>then rank-ordering each practice according to the relative frequengy indicated 
by the preservice teacher responses. In addition, patterns were interpreted by. 
clustering conceptually-related and frequently-related items. Phi coefficient, 
chi 2 , means, and standard deviation statistics were computed for each item. 
In determining the phi coefficient and chi 2 statistic, frequency counts of the 
"response categories (always, sometimes, seldom and never) were combined to 
represent frequent and infrequent practices. The response categories "always" 
and "sometimes" formed the frequent practices, where as. the "seldom" and 
"never" response formed the infrequent practices. The "I forget" response was 
not considered in the computation? A five-point scale was used for determining 
the mean and standardjdeviation for each item. All response categories were 
included in the scale with the "always" response considered as a five ranging 
to the "I forger^fcponse as a one. 

. • Results 

The indicators of soundness of the PTPq were found to be generally 
supportive. All 83 items were judged to be explicit on the basis that less than 
5% of the total responses indicated that any one item was unclear. However 
the unclear responses did differentiate subjects. Seventeen of the 110 subjects 
"accounted for more than 95% of the uncfcar responses, with one preservice 
teacher marking 32 of the 83 items "unclear." . 

' 19 J 
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Rased uDon an item-by-item, visual inspection of three frequency distributions 
reSna from the admimstrat on of the three forms, the PTPQ appears -to be 
nas stent Frequency distributions for only three of the 83 items appeared to 
^Z^not,r7i d ere ces However, when the adjacent response .categories 
S^bm^Xen?and. .nlrequen, response 

distribution of each of the three forms w,as within a. five (5) percentage point 
range of the total frequency distribution for the item. , parhina radices ' 

IKwas reasoned by the writers that (1) conceptually related 
would be nerceived by preservice teachers as occur: ..»y ^ similar trequency 
and ?2Hha? concVptually opposite teaching practices would be perceived as / 
^^mx^^^e^. Data supported the firs, proposijo n , n that . 
seveel dusters of similar teaching practices were pe fce-ved by me s bjectS 

appears ^q esnonable Whereas reciprocal statement pairs such as items 5 and 6 
nd'cate ?"ogicall? consistent *lationship, items 1 7 and 1 8 are inconsistent. . 

Table 1 

Perceptions of Teaching Practices Related to Oral Reading 

Phi b Chi 2c SD M Concepts 



Rank 


Item 3 


1 ' 


29 • 


2 


31 


3 


28 


4 


30 


5 


17 • 


6 


6 


7 


45 


3 


8 


9 


18 


10 


24 


11 


27 


12 


25 


13 


19 


14 


44 


15 


26 


16 


20 


17 


46 


18 


47 


19 


7 


20 


5 



563 (01) .5 4,65 Read witrfexpression 
506 ( 01) 9 4 38 Read accurately 
483 (01) ' .8 4,46 v Read carefully 
470 (01) 8 4.34 Pronounce distinctly 
412 01) 8 4.17 Called on s'tudents systematically 
348 ( 01) 9 3 95 Without prior preparation 
280 ( 01) 1.0 3.56 Reread relevant segments . 
235 ( 01) .8 3.61 Peer prompting 
166 ( 05) 8 3 65 Called on students randomly 
1 47 (05) 1 2 3.35 Corrected for gue^ing 
.125 1.3 3.35 Corrected for substitution-sound 

similarity 

Q98 ! .1 3.35 Corrected for failure to attempt a, 

word 

052 9 3.39 Called on volunteers 

04 9 1 1 3 21 Reread entire story 

0?7 1 2 3 21 ' Corrected for serrantic substitution 

005 1 '.0- 3.29 Selected best reader for majority 

-of reading 

0 q 6 " 1 1 2 96 Clear rationale for rereading 

; 128 l'l 2A6 Reread pts. of story- w/difficult 

words / 

146 (05) 1 0 2.94 It error, peer assigned to continue 
- 191 (.01) 1.0 2.95 Prepare for oral r eading 



ccur.ng ^frequently . . 

-oe's m oarentnesps indicate ievet of significance reacr 



f ^ r : n ? '^"11 U»i of s.an,ficance reached by applying Ch,' statistic 



It was further assumed by the authors that, if the PTPQ was to be of diagnostic 
'use, it should •ontaih items that addressed perceived teaching practices that 
range from very frequent to those that rarely occur. Keeping in mind that a 5-poirft 
scale was used in ,which the low frequency responses were valued at two for 
never and one for "I forget" there were eleven items with means of 4.00 and 
greater, 48 items withVneans between 3.00 and 3.99, and 24 items with means 
betweeh 2.00 and 2.99. Variability of the items ranged between standard 
deviation^ of .5 to 1.5, with standard deviations f$r fifty-five of the 83 items 
being 1.0 and greater. Thus the instrument- was judged to be effective in 
reflecting the individual differences of adult students. - . 

In addition to analyzing the results for soundness purposes, >ihe authors 
examined the data for indicators of th^ types of perceptions preservice teachers 
held 'respective to reading instructional practices. The 83 items of the PTPQ 
were classified into six categories - (1) oral, reading, (2) word recognition,. 
(3) comprehension, (4) classroom management, (5) general instructional practices, 
and (6) teaching practices related to ideas about reading. Following the classifi- 
cation process, the itegns in eadti category were rank ordered by frequency. 
This was done on* the assumption that thqSe behaviors perceived to be the 
more frequent would have greater impact on preservice tearfhers as a group than 
thojfe teacher behaviors that were 1 perceived to occur less frequently. Isolated 
• teacher practices of significance occurred in each of the six categories; however, 
the results reported here will be limited to ttaose trends interpretable from clusters 
of perceived teaching practices. 

Seven general trends were suggested by the data. The first trend was formed 
from nine items that related .to the teacher communicating expectations or 
criteria for student performance. Jhis trend was perceived by * preservice 
teachers to be the most frequently occurring of those determined by the instru- 
ment. Items 28, 29. 30 and 31 in Table 1 exemplify the perceived teaching 
behaviors that comprised this trend. 

A second trend noted from the responses concerned the management of 
students' performances. The 22 teaching behaviors comprising this trend 
included determining of« specific reading activities, grouping practices, and 
procedures for eliciting responses. vVhile some of these practices vary in 
frequency from very high to very low, the majority of, the practices were 
perceived by preservice teachers to be of frequent occurence. The third trend, 
was alsoXerceived to be of frequent occurence. This trend was formulated by 
teaching practices related to correcting inappropriate student responses. Items in 
Table 1 such as 8, 24, 25, 26 exemplify the types of teaching behaviors 
that comprised this trend. In general the clusters </ teaching behaviors that 
were perceived to opcur frequently - trends one, two, and three, are at least 
tangentially related to the practical, how-to-do-it aspects of reading instruction. 

The fourth, fifth, and sixth trends interpretable from the data w6re perceived 
by preservice teachers to occur infrequently. The fourth trend was formed from 
seven teaching practices related to preparing students for subsequent reading 
activities. The cluster includes items such as 6, 46, and 5 in Table . 1. Trend 
four is logically related to trends one and three. An example of this relationship 
can be illustrated by the concepts reflected by items 5, 29, and 24 listed in 
Table 1 . For*example, if students were provided with oppo?tunities to prepare 
for oral. reading, then it seems likely that they would read expressively and, 
in addition, the teacher would not need to make frequent corrections for guessing. 
However, the data, suggests that this relationship did not hold in that teachers 
were remembered as seldom preparing students regarding the rationale for a 
task. " . 

4 
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Trends five and six are more tentative in that there were' only two or three 
, 0 Lc^f a id tfeach chjster Most presence teachers perceived their reading 
^f^^^A\^a\\m instruction (trend five) and us,ng ^ 

^^SSS^^ to the;, forger ^J™J™£ 
are I reSedS the reasons for, or the Why of, reading .nstru.ct.on. 



Conclusions -< j 



' ThP results respective to both the characteristics of the. instrument and the 
in fo mS resuSSom its application, have been 

wmmmm 

in order to test this interpolation. 
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An effective inseryice model for 
content area reading in the secondary schools 

Staff develQpment programs for teachers may take many forms as have been 
described in recent literature. Often typical inservice programs, described as 
"various kinds of one-shot 'dog and pony shows' " (Dillon, 1976, p. 169), 
involve a visiting consultant who is invited to the school for an hour or two to 
speak 6n his/her -area of expertise. Teachers listen dutifully; some even take 
"notes. But no serious change in their practices and attitudes is either expected or 
achieved. The problem with this "haphazard" approach to staff development is 
that "it is disorganized, piecemeal, patqhwork" (Edeifelt, pi77, p.. 112). This 
paper is a report of.the development of a model for reading inservice education 
in the secondary schools that was des'igned to have an impact. 

The Content Area. Reading Project,- which was funded by the Pennsylvania 
Department of Education from January 1, 1976 - June 30, 1977, involved an 
inservice effort for jliniof high school and adult, basic education/high school m 
equivalency (ABE/GED) teachers. The Project's- goal was, first, to change 
attitudes positively toward teaching reading as part of content area subjects and, 
second, to effect change in that direction in classroom practices. Although all 
cooperating schools had identified content area reading as a weakness among- 
students primarily due to inadequate teaching strategies, we realized thr; as 
outsiders or "visiting experts" from a university, we might, in fact, have very 
little impact on teachers' attitudes and classroom practices (b\\o & Erickson, 
1973).. Therefore, we chose a model for inservice education which might be 
used mord often by a consultant regularly employed by a school district 
(e.g./ curriculum director, reading consultant) in which a series of inservice 
sessions could be scheduled over a long period 'of time- with -follow-up in 
- the classrooms. 

The school administrators, particularly at the junior high school level, were well 
aware of the problems of integrating reading with content subjects. Content 
teachers who may be\we!l versed in ' their subject areas frequently have 
difficulty individualizing instruction for different reading abilities due to large, 
changing content classes (Burnett & Schneli, 1 975). Olivero (1 976) also suggests 
that "probably- less than one percent of the secondary teachers have ever been 
taught the skills" (p. 195) of diagnosis necessary for individualization of 
instruction. They may simply not have had any course work in reading methods 
during their own preservice teacher training experience (Morrison & Austin, 1977) 
and may not be aware of how to help students r^ad content materials. Or 
they may resist teaching reading skills "by rationalizing that 'reading isrj^r 
my subject* " (Axelrod, 1 975, p. 82). . < 

Therefore, as Olivero (1976) suggests, changing attitudes was considered of 
primary importance. Since affective objectives may be attained primarily through 
activities with high experience impact and two-way communication (Otto & 
Erickson, 1973), follow-up between inservice sessions by graduate assistants 
serving as reading consultants seemed essential to observe and provide 
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feedback on progress toward integrating'reading skills in the content subject 

A comoetencv-based format was selected since differences' among the 
teache?sTet anSpaled in entry Ivels and- progress in both- attitudes and 
skills /computer record-keeping system was also employed to .keep track 
of ^e/hers progress in meeting workshop objectives o< competency 
At Xh session teachers repeived a computer printout showing objectives 
mastrod and unmastered. We hoped that modeling a diagnos^-prescnpt ve 
aobroach in the workehops-w.ould influence .teachers toward diagnost.c- 
oEptive teaching Objectives, which are presented elsewhere (Dupuis & 
Askov 1977P e written at three levels: .cognitive, simulation, and applica- 
nnn It waV anticipated" that the skills learned in the workshops would be 
applied and observed in the classroom where the teacher integrates the skills 
previously mastered at the cognitive and simulation levels. 



f 

Method 



^target group for the study consisted of teachers of ^'J^ 3 ^ 
students at three sites Since reading instruction for ABE/GED teachers nas 
bee gene X lacking (Hall & Coley, 1975), they were included w.th ,un,or 
high school teachers who teach -potential or future members of ABE/GED 
classes (for students who dropout of school). Reading specialists were also 
Sdufed sc I 'that they might become more sensitive to the demands of the 
c^^^ cllWoin and provide help to content teachers as they deal with 
Students of all reading abilities. 



following recommendations made by others (Campbell, 1973, Parker & 
■ CamS 1973) we planned a year-long ■ program, consistmg of fjeen 
Sfmomhly sessions, each lasting three hours, for volurt^ teajljj^ ee or 
six hours of university credit were available as an option for teacners wno 

W ^^^eZ^e Pr ese^an, suburban, and rural settings. The 
worktop w^re he^ at the junior high school in each site. Teachers ; were 
offered the option of releaseeWiwfi^ur.ng the school day W th subsides 
h ed by the project; hc^ever^yl^ad elected to receive , ex ra compensa- 
tion for attending the training sessions in the evening , or after schoo 

A staff of four graduate education students was hired to serve as scnooi 
ronsu tan's Each was assigned to teachers in. a given school to provide 
ollow U !up in the classrooms between workshop sessions. They not on y a tend d 
• the workshops at their particular sites, assisting the two university instructo rs, 
but also observed classes, held conferences with teachers and did demonstra 
Son teaching as requested. They served in the role of helper (Long, 1977, 
p. 81) and facilitator of change. ; • 



Materials 



Modef materials were created by project staff to demonstrate > how teachers 

might construct and use these materials in the '^ cla f r °° m , s hp W J 0 HSf ^ 
hnnPri bv modelinq the use of multimedia materials in the workshops to 
Sraae the use o " these materials in the classrooms. For example a 
M^JavZriOrtWlsocM studies lesson was created which demonstrated 
hn» Content a?es i teacher could make his/her own videotape to be used n 
da^sroom InstVuTon. f iSening center pertaining to note-taking was set up to 



give teachers experience in listening comprehension $nd to encourage the 
use of such centers in'their classrooms. . 

Ah advisory board, consisting of professors at The Pennsylvania State 
University from the various content areas which are typically taught in junior 
high schools and adult programs, was established. Each advisory board 
member identified sample materials from his/her content field that might be used 
' x in junior high school and*ABE/GED programs as well as professions! books 
/relating reading skills to the content area subject. A professional library, con- 
/ sisting of books suggested by advisory board members plus many other 
resources dealing with content area reading, was developed for each site. The 
. professional library was provided so that teachers could complete assigned 
readings without having to travel to a university library. 

Evaluation 

An evaluation design was planned to determine the effectiveness of the 
inservice education tnode. Following suggestions for evaluating staff develop-' 
ment programs (Bishop, 1977), various types of formative and sfimmative 
evaluation were planned in addition to assessing mastery of each workshop 
objective Since results could not be expected among junior high and ABE/GED 
students immediately, only teacher vafiables were considered. So few teachers 
of adults signed up for the project that their data were excluded from the - 
following analyses^Therefore, conclusions are presented only in terms of the 
junior high school teachers. 

Since attitude change was considered to be a prerequisite to changes in 
cfassroom practices, attitudes toward incorporating reading skills in content area 
subjects were) assessed withMwd instruments, a twenty-item Likert scale 
(Statements Survey, r=.84) and a less direct instrument in which teachers 
responded tp/descriptions of twelve teaching situations using five sets of 
. bipolar objectives on a semantic differential scafe (Situations Survey/= .90). 
(See Dupuis & Askov, 1977, for information on the development of the 
instruments^ Workshop teacfters were pre and posttested with these two. 
attitude ihstruments and cthe Purdue t Tfea6her Opinion^ire (Bentley & Rempel, 
1973). The Purdue Tea'cher Opinionaire (PTO), a measure of satisfaction, with 
the teaching situation, was administered to control for changes in attitudes 
" that might be due to school factors rather than to the Project. A comparison ■ 
grpup, teachers who worked in the same schools but were ; not part of the 
workshops, was established in each junior high school; comparison group 
teachers were administered the same pre and posttests.' 

A treatment x time interaction effect was found, F0, 123) = 5.46, p<.05, 
with experimental teachers' gains on the attitude measures significantly greater 
than the gains of the comparison teachers. No significant time effect was found 
for the morale measure (PTO); morale, although significantly lower at tne urban 
site, remained constant at all sites and did not appear to be a' significant 
factor in determining attitude toward incorporating reading instruction ifri content 
area classrooms. The means are presented in Table 1 . 

In addition to assessing changes in teachers* attitudes, a criterion-referenced 
knowledge of reading skills test, which measured teachers', mastery of the 
cognitive aspects of the workshop objectives, was administered only to work- 
shop teachers at the beginning of the first workshop session and again at the 
last workshop session. Mastery level was established ' at 80%. Of the 56 
experimental group teachers who took the test at ttje. first workshop session, • 
only three scored 80% or better. Of these same 56 teachers, significantly ' 

' o '■ is ' ' 
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Table 1 



Pre and Posttest Observed Mean Scores on Attitude Measures 



CD 



' Treatment Groups Combined: 

Statements Survey Scores 

(Lftert scale) 
Situations Survey Scores 

(Semantic differentia! scale) 
PTO 



Experimental Groups Only. 
Statements Survey Scores 
Situations Survey Scores 
PTO 

Comparison Groups Only: 
Statements Survey Scores 
Situations Survey Scores 
PTO 



All Sites Combined 
Pre- Post 



85.40 

(o = 129) 
348.17 

(n=129\ 
288.90 

(o = 1'27) 



86.06 
' (o = 58> 
346.96 

(o = 57) 
284.38 
(o=55) 



84.86 

(0 = 71) 
349.1 3 

(0 = 72) 
293.50 
(0 = 70) 



87.28 

(o=129) 
366.68 

(o=130) 
288.11 

(o=127) 



90.78 

(0 = 58) 
374.90 

(0 = 57) 
284.33 

(o = 55) 



84.10 

(0 = 72) 
360.25 

(0=72) 
292.93 
(o = 7Q) 



Urban Site Only 
Pre Post 



83.78^ 

(o = 41)' 
349.76 • 

(0 = 42) 
235.08 * 

(0 = 40) 



84.62 

(n = 21) 
347.43 

(o = 21) 
245.21 

(n=19) 



82.90 

(o = 20) 
352.10 

(n = 21) 
223.15 

(n = 19) 



86.29 

(o = 41) 
358.40 

(0 = 42) 
231 .95 ' 

(0 = 40) 



91.81 

(0 = 21) 
368.95 
. (o = 21) 
239.68 
(0 = 19) 



79.57 

(0=21) 
347.86 

(0 = 21) 
225.11 

(0 = 19) 



Suburban Site Only 
Pre Post 



88.29 

(0 = 49) 
354.69 

(0 = 48) 
323.57 

(0 = 49) 



88.90 

(0 = 20) 
35168 

(0 = 19) 
315.00 

(0 = 20) 



87.86 

(0 = 29) 
356.66 

(0 = 29) 
329.48 

(0 = 29) 



88.59 

(o = 49> 
373.69 

(0 = 49) 
324.06 

(0 = 49) 



92.05 

(0 = 20) 
377.00 

(0=19) 
317.70 

(0 = 20) 



86.21 

(0 = 29) 
371 .97 
. (0 = 29) 
328.^5 

(o=29) 



Rural Site Only 
Pre Post 



83.49 

(0 = 39) 
338.44 

(0 = 39) 
300.84 

(0 = 38) 



84 53 

(0 = 17) 
341.12 

(0 = 17) 
292.63 

(0 = 16) 



82.68 

(o = 22) 
336.36 

(0 = 22) 
306.82 

(0 = 22) 



86.67 

(0 = 39} 
366.77 , 

(0 = 39) 
300.86 

(0 = 38) 



88.00 
. (o = 17) 
379.88 

(0=17), 
295.63 
(0=16) 



85.64 

(0 = 22) 
356.64 

(0 = 22) 
304.68 

(o = 22) 
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more teachers, 22, reached mastery level of 80% on the ppsttest given at 
the'final workshop session. In addition, in a site x time analysis of variance of 
actual pre and posttest scores, significant gains, F(1 ,53 = 196.02, p .001, 
were found. This would indicate that workshop participants significantly 
increased their knowledge of basic methods for teaching reading in content 
areas ' * 

The graduate assistants assigned to work onsite with the teachers also 
provided ratings of the experimental teachers at the beginning and end of the 
school year. These ratings pertained to the degree to which the teachers were 
judged to have incorporated reading skills in their classrooms. Mean exit ratings 
were significantly higher, f(57) = 9.851, p<.001, across all sites (Dupuis & 
Askov, 1977). Therefore, it' appears that what was being learned in the work- 
shops was indeed being applied in the classrooms. 

The data,, as summarized, seenh to indicate that the year-long inservice 
program, in content area reading significantly changed junior high school 
teachers' attitudes toward and knowledge of readin| skills. Furthermore, 
the consultant ratiqas* indicate that these changes were reflected jn classroom 
practices; teachet^Ve re observed to incorporate reading instruction into the 
-content study to a greater .extent after the treatment period. Classroom applica- 
tion was considered an important aspect of the Content Area Reading Project 
since mostjnservice efforts stop short of classroom observations to make^sure 
that workshop learnings are being applied during instruction. 

In addition to the summative evaluation data just reported, other kinds of 
data were gathered in project evaluation. The issue of whether university credit 
had art impact on mastery of workshop objectives was considered since 
credit is often considered as a necessary incentive in inservice teacher education. 
Of the total experimental sample, 60% elected to register for graduate credit 
while participating in the workshop program: 86% of the teachers at the urban 
si-te 30% at the suburban site, and 65% at -the rural site. QMhe experimental 
teachers registered for credit, 89% satisfactorily completed all objectives required 
for graduate credit: 83% at the urbari -site, 100% at the suburban and rural 
sites Of those who did not register for credit, 1 3% .satisfactorily completed 
enough objectives 1o earn a certificate of completion (awarded to those who 
completed the required workshop objectives). » 

However when credit status (or credit x time) was examined in relation to 
attitude scores, no significant effects were found. Whether or not a teacher 
elected to participate in the project for credit or no credit was not significantly 
related to attitude change during the experimental treatment periocj. 

Furthermore, no significant effects' were fOund for the credit status (or credit 
x time) on the criterion-referenced knowledge^ reading skills test. In>pther 
words change in the criterion-referenced test scores was not -significantly 
related to whether or not a teacher elected the graduate credit program 
option offered as part of the experimental treatment. 

Therefore although credit status did affect the number of workshop Objectives 
completed, it did not seem to be related to changes in attitude toward and 
knowledge 'of reading skills. Apparently, changes in attitudes and knowledge 
were occurring from workshop attendance alone rather than from completing the 
required work. ( ..... 

Evaluation data provided by the teachers in ternis of the value and usability 
of each workshop activity are reported elsewhere (Dupuis & Askov, 1977). 
In general, teachers tended to favor the practical activities with direct 
classroom application (e.g., vocabulary exercises) as opposed to those that 
were more theoretical in nature (e.g., the concept ot linguistic differences). 

• .-' 2u 
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•They were also extremely positive toward the follow-up help giverv between 
, ^^^a!2^^SS?-ly favorably to the competency-based 
.ormafo he'nservIS progra£; however a period of *^* T £. ^ 

""nrTol ihe most perplexing problems was the poor response by ABG/GED 

mmmm 

where (Schroeder & Haggerty , 1 976). 



Conclusion - M , , ± 



instead of in university classrooms^ 

ons,te between workshop sessions to he Ip ^hers apply wnai u y 

in their own classrooms. Perhaps » ^ wou» n Q f ^Vobjectives to 

teachers had. not been forced byjhe st ructure oMhe eq ^ 

' SffihJ Mei 6 « ent achievement which of .curse 
is ,he «^^>™*^£^ ^Sg'sS^however; are 
: Z^r;Zt^es i™ reading instruction .in the content areas. 
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The effectiveness ol 'Informal assessment questions 
constructed by secondary teachers 



The documentation and research concerning informal reading Jnyentojes 
k J rtoSS almost exclusively with elementary school use. Few studies have 
See fdone on the "usfor Sectiveness of these measures at (MMM** . 
Felel nesoite this lack of documentation, numerous reading authorities have . 
L%ocKhe use of ?nformal reading inventories at the Moondvy JjJ 
fftmse 1974) Teachers have been frequently encouraged to deve op their 
SSrhal assessment techniques. Teacher-constricted inventories have 
teenTavTed because a minimal amount of work has been done on pubhshed 
Sven ones with secondary level passages or on '.inventories which meet the 
LDecialized needs of content areasT In an effort to aid teachers with this task 
S 'tevS Twmten describing effective Questioning procedures and 
articles nave twwii vv. luckina 1975) Studies have been undertaken 

?o° im™ the^aMng ol teacher ?n ,'esUng 1 techniques (Rogers and 

feaclSs to develop their own IRIs will be cnt.cally examined. 

Method 

. v ■ 

M The informal reading inventory used in the study was developed to provide a 
reasonabnuick y^effective means of assessing the read.ng performance 
of4econd y a?y students individually or as a group » JS^^S 

ieM 

350 wSds in llngth Each of the passages was evaluated by means of the , Fry 
IS* reHort waamade to seieot passages on the .basis o. 
inference, one speculation, and one application question. 



'The p'assages and questions were administered to seventh, ninth, and 
eleventh grade students in an upper middle class suburban cqmmunity 
•located near a large city. The inventory was administered as a group task by 
'the s .classroom teache/s. ATI inventories were scored by a single individual. 

. ° e A 9 three factor design with two between-subjects variables, grade and 
passage, and one withih-subjestsvariable, question types, was used in the ., 
Study, ^ach student read passage^* one grade level. , 

*\ 'J 

v • Results , * , 

The number of correct responses for each set of\he teacher-construcjed • 
questions was tabulated and then subjected*. to, a three-way analysis -of • 
variance. Significant effects were found for postage, f.(4, 165) - 581 p < 
01 and question type, F (3, 495) = 20.18, 001, and for interactions 
between "grade and passages, F (8. 165) = Iwfe, P < -05, and passages and 
question types, F (12, 495) = 3.87, p < .0^. ffi^ficant effect was got found ) 
. for the grade varjable. However, linear treWatiaTy^es for grade and passage 
indicated the expected progression across grade 'levels and across passages. 
Jests carriep-out using the Newman-Keuls procedure on the question variable 
indicated vocabulary Questions wefe significantly more difficult than other 

question types. <*•.•.. . ..... .. 

The questions were next evaluated by use of a discrimination and difficulty 
index program (Linden & Mazzucca, 1977). The difficulty index was identified 
as the percerrffge of subjects who answered^ H>e questions correctly. 
Questions were subsequently labeled as easyt average, or, hard. The 
discrimination index was identified as the ability of the , questions to 
discriminate between high and low scoring subjects and represented the 
percentage of maximufri possible discriminations obtained. Questions were 
identified with ratings varying in range from negatively discriminating to highly 
discriminating. • . . 

The results of the difficulty and discrimination index program were used to- 
determine whether or not the questions had been effective with subjects at the 
grade level they were intended for. If exhibiting appropriate difficulty at grade 
level questions were expected to consist of a large portion of average items 
with a smaller balance of easy and hard items. The results of this analysis 
indicated.an 'acceptable balance of difficulty for the inventory questions. For 
seventh grade level the results showed three easy, six average, and one hard 
question. The ninth and eleventh grade results showed one easy, eight 
average, aYtdane hard question! If discriminating effectively, questions were 
expected to lie within the plus'one to plus four discriminating range, preferably 
* at plus two or above. The program results indicated'that while there were no 
negatively discriminating "items', the bulk of questions were nondiscriminating 
or low discriminating items with a plus one rating. 

The results of the program were also used to determine whether or not 
•logical progression in terms of difficulty and discrimination existed across 
grade levels. It was expected, for example, that questions accompanying fifth 
V' nrade level passages would be easy and nondiscriminating for the seventh, 
- ninth and eleventh grade subjects. The program results indicated the 
questions were of appropriate difficulty across grade levels but were 
inadequate or inconsistent in progression and levels of discrimination. 
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Discussion 



The analys.s of variance results-suggest that as a whole, the set of inventory 
questions operates appropriately by ^^^^S^iSS^vi 
among the subjects and the graded passages and ques ^^"JJ^JJJ 
other evaluative procedures suggest the quest ions are no1 ^eJJ 
constructed The Newman-Keuls tests indicate teachers have a tncu iiy 
dSp?ng appropriate vocabulary questions. Although the difficulty level of 
S?ons appears satisfactory both for intended grade level and across grade 
levels the inventory questions often do not discriminate between high and bw 
scor irt'c i stC Certainly only questionable use can be madd of an informal 
^^wtoSw" its itemscannot discriminate between good and poor 

"^results of .this study seem to suggest ^^^^^ 
encouraging teacher-construdted secondary reading inventories. P^service 
an5 mservice training programs need to focus on be JJ | WJgJ t gS 
t^rhPrs in constructing and evaluating questions. More thorough instruction 
n the use ?co ring and interpretation of informal reading inventories, s 
npidPd Teachers must be repeatedly cautioned to use such results n 
function with othe^ available pupil information. Encouragement to create 
fnfTmaTSng inven tories should bnly be given when adequate preparation 

'"u^^^wV* trained professional, teacher-constructed 
assessment q'u'esE can^work effectively. Ifcwever, it seems Possible that 
rnappro^nate q assumptions have been made in encouraging secondary 
teache-rs to construct IR Is. ,■ . . 
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Observed patterns of teacher-pupil classroom behavior as 
V predictors of stfident gro wth in. reading 

This paper explores the relationships between observed classroom behavior; 
and student achievement in reading. In two earlie.r^studies, the investigators 
reported significant relationships between classroom, mean student copmg sty d, 
teacher control, and reading achievement (Coker and Lorentz,, 1 975) as welt as 
between individual student copfna style and rating achievement (Coker and 
Lorentz 1 976) The present paper is a further^examination of .data collected 
in 1974-76 as 'part of the Carroll County CBTC Project It is the purpose of 
the present study, (o examine' the "relationship between observed classroom 
behavior specifically, teacher-pupil interactions and reading achievement. 

It was hypothesized that certain teacher-pupil interactions, as recorded by 
direct observation using the, Spaulding Teacher Activity Rating Schedule • 
woukj be significant predictors of end of year reading achievement. 

v 

Method' 

r Su£)/6CfS * * 1 

The participants in the 'present study, 41 classroom teachers, grades 3-8, 
are a subset of the population of 103 teachers who participated in the CBTC . 
Protect during the 1974-76 school year (1974-75 sample * 26 classe?, 
1975-76 sample = .15 additional classes). Only those teachers who taught 
reading as' a major subject are included in the sample selected.for the present 

StU An estimate of the socioeconomic status (SES) was obtained for each 
pupil based upon the occupation of the family's principal bread-winner. This 
occupational information was' transformed to* a 1-8 scale (1 = high status 
•8 = low status) using a modified version of the Warner, Meeker, and Eeffs (1960) 
Classification of occupations and levels. Pupils with missing SES were assigned 
Ihe mean for their classroom, rounded to the nearest whoTe number. 

^fhe/owa Tests of Basic Skills, ' Form 5 and 6, Levels Edition, N (ITBS) 
(Hieronymus and Lindquist, 1971) was given to all pupils in grades 3 through 
V who, were involved in the CBTC Project. The flead/% Comprehension 
subtest only was used in. the present study. The reported reliability and 
•validity for the ITBS are adequate' Tor use In the present study. In addition, 
'the Estate of Georgia used the ITBS in the 4th and 8th grades in its 

^te'sVSTe^rAttMty Rating Schedule (STARS) (Spaulding, 1975) 
is -' category system which examines the cognitive instructional strategies o 
teachers as well as their affective and control techniques. STARS, consists of 
25 categories of teacher behavior which are subsumed under the subtitles 
of affective behavior, motor and social structuring, concept attainment, poncep 
checking and value expression, as they interact with 19 categories of student 
behavior which are identified by descriptive statements such as Aggressive 
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"Behavior" "Self-directed Activity", "Observing Passively", etc. STARS has been 
developed over a period of ten years and is currently being used in a number 
of studies • ' • 

■ The initial reading tests were administered early in the fall in the classrdoms 
' of the 41' Carroll County teachers, and posttestS were administered in late 
spring in the same classrooms. A group of specially-trained testers administered 
the tests in each classroom. . • . . 

• After the pretests had been administered, each of the appwximately • 
1200 DUDils in the 41 classes was observed on two separate visits during a 
three-week period by observers using the CASES observation mstrumen 
(Spaulding, 1970). Each pupil's 'record was scored-to identify a predominant 
coping style, such as Aggressive, Passive, Task-oriented, etc- 

After the pretests had been administered and the CASES scored sw 
pupils with different predominant coping styles were randomly selected from 
each classroom. These six students were used in- subsequent observations 
using STARS. Six such observations were made in each classroom during the 
remainder of the school year. ; . ' 

Visits were scheduled in advance, but no attempt was made to pre- 
select the activities to be' observed, since the intent was to record a 
representative sample of the behavior present in each classroom. 

All observations were made by experienced classroom teachers who tiad 
been employed full-time by the CBTC Project as observers and had been 
pefson'ally trained by the author of CASES and STARS,, Dr. Robert Spaulding - 
Observer aqreement studies, conducted in non-project classrooms, confirmed 
that the observers, were consistently able to maintain an agreement of 80 
percent or better. 

^TeSfwere hand or machine scored, keypunched and verified. Observation 
data were keypunched from the data collection forms and verified. 

ITBS scores were converted from raw scores' to Standard Scores us.ng 
the published Standard Score Tables (Houghton Mifflin, 1973). These scores 
uspd a normalized standard score scale with a mean of 80 and a standard 
deviatioh of 20 for the entire grade range, 3-8 data in a single analysis. 

STARS frequencies for each pupil were combined using Spaulding s 
procedure (Spaulding. personal communication) to identify 
These are derived from combinations of behaviors recoreded in he 475 
' cell (i.e. 25 x 19) student/teacher interaction matrix and labeled with titles 
descriptive of individual teaching behaviors. 

The pretest and posttest reading scores, a ShS measure, grade leve 1 !, and 
twenty-one STARS scores were the variables used.. 
' Two additional variables were generated for each student. These were Pre x 
Grade and Pretest Squared. Pre x Grade was generated because the' cor- 
relations of pretest and posttest tend to increase at higher giade levels and 
his term permits fitting different regression slopes at .different grade teyels. 
The Pretest Squared term permits fitting a two degree curve to the relation 
between pretest and posttest, and tests if a nonlinear fit is significantly 

^exTmineTe'relationships between the STARS scores and reading 
achievement two stepwise multiple regression analyses were carried out 
using the SPSS (Nie et al , 1 975) regression procedure 
The dependent variable in both analyses was posttest, In the first analysis. 
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the independent variables were forced to enter the equation in the following 
order- pretest grade, SES. Pre x Grade, pretest squared, and finally STARS 
• 1 through STARS 21 in the second analysis, pretest was forced to enter 
first and the remaining variables were permitted to erjter in the order in which 
they accounted for additional proportions of the variance. 

Results 

As predicted, pretest is the best predictor of posttest Reading (r =' .81), 
accounting for nearly two-thirds of the variance. Several STARS scores 
were better predictors of posttest reeding than were grade and SES. Therefore, 
the second analysis, in which all variables were permitted to enter the 
equation depending on their contribution, is considered in the following* 
discussion. Table 1 presents the variables in their order of entry into the 
equation. 

Table 1 



Multiple Regression Analysis 



Variable 



Multiple Simple. 
R R 



Fto' 
Enter 



Pre-Readmg ^ 
21 . Structuring Supervisor 
Pre x Grade 

19 Supportive Guide 
SES 

20 Egocentric Instructor . 

2 Boring Lecturer 

17. Style EFG Treatment 
10. Effective Manager 
13 Style A Treatment 

Pretest Sqtared 

Grade 

3 Examiner 
6 Gounselor 

4 Entertainer 

1 2 Rote Process 

8 Discovery 

9 Socratic 

15 Style C Treatment 
1 Effective Storyjeller 

1 1 Expository 
5. Controller 
7. Pseudo Peer 

1 4. Style B Treatment 

1 8 Responsive Explainer 

*** p<001 
p< 01 • 
p<.05 



81 


81 


376.18 


82 


-23 


10.01 


83 


\ 78 


8.46 


83 


-20 


6.48 


84 


-27 


. 5.41 


84 


32 


4.65 


84 


1 5 


2.77 


85 


-03 


2.32 


85 


-01 


5.09 


85 


02 


1.42 


85 


81 


1.07 


85 . 


64 


3.21 


86 


-18 


1 .64 


86 


-07 


1 .48 


86 


32 


0.70 


86 - 


-16 


0.33 


86 


-12 


0.46 


86 


06 


0.62 


86 


-09 


0.20 


86 


19 


0.19 


86 


08 


0.27 


86 


-07 


0.09 


86 


15 


0.09 


86 


-03 


0,02 


86 


07 


0.01 
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Discussion 



•One unexpected result was that several of the STARS scores .were bette 
predictors of read.ng achievement than were either Grade Level or SES Anothe 
surprising finding was the negative relationship between reading and teacher/pupri 
interactions judged to be "good", such as Structuring Supervisor an Support ve. 
Gurde. Also unpredicted was the positive relationship between Reaoing 
Achievement and'Egocentric Instructor CT adc crnrpc *iirh as 

The- lack of significant contribution of other "good STARS scores such as 
' Discovery Teacher and Socratic Teacher'is probably due to the low frequency 
of these behaviors in the Carroll County data. , , !oc Thou dinnpo 

Tnese findings are in general agreement with other studies. They suggest 
that while the best indicator of a student's growth in reading is his pretes 
score, observed behav,ors of both students and teachers may be useful 
predictors of achievement in reading. 
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Teacher inferences of the characteristics 
of non-standard speaking readers 



% Early attempts to teach reading to non-standa;d Black Dialect speaker 
were based tor the most part on the assumption Jhat J^^W^O* us© d Dy 
these children was deficient or incomp' 



y 

ete in development. This was due, it was 
proposed to the culturally deprived environment from which, these children 
came a lack of early stimulation resulted in inadequate speech and retarded 
cognitive development and subsequent failurein school, especially in learning to 

Challenges to this deficit model of language development began, with 
McDavid's (1964) plea for teacher acceptance of a concept of Black Bialect 
as a different language system rather than a deficient, underdeveloped 
version of Standard English. According to most sociolinguists (e.g., Baratz, 1969), 
Black Dialects are well-formed, highly developed linguistic systems with phono- 
logical and grammatical rules of their own. While different from Standard 
English the dialects are not deformed or deficient versions of Standard English. . 

The deficit model "easily" explained reading failure as lack of language and 
cognitive development, and recommended remediation in language, i.e 
standardization from "amalgamated noises" to Standard English phonology and 
grammare'(Bereiter and Englemann, 1966). The difference model -explained 
reading failure as a "mismatch" between the linguistic system of the Black 
.Dialect speaking child and the linguistic system represented in beginning 
reading materials (Goodman, 1969). 

The mismatch hypothesis led to a number of research projects conducted 
to assess the effects of dialect difference on reading comprehension and on 
learning to read (Nolen, 1972; Sims, 1972; Baratz, 1973; Melmed, 1973, 
Hockman 1974). Representative of this research is a study by Simons and 
Johnson (1974) in which it-was postulated that Black Dialect speaking children 
would comprehend passages written in Black Dialect better than they would 
comprehend passages written in Standard English. Black Dialect versions o 
Standard English , materials ' were written and presented to Black Dialect 
speaking children. No evidence was found to support this version of the 
dialect interference theory. ,<•«•„., It „ 

' Findings such as this have led to other "explanations of the reading difficulty 
many children experience. One of these is the argument that a potential cause 
of the reading failure of nonstandard speaking children is a tendency of 
teachers to force word-for-word accuracy in oral reading, rather than viewing 
reading as a meaning getting process (Goodman and Buck, 1973; Cunningham 
1975) The latter conception of reading would require a teacher to distinguish 
between meaning versus non-meaning changing miscues or deviations. The 
distinction is important since Goodman and Sims (1 974) report that 8 ack 
Dialect speakers frequently read Standard English structures orally as. Back 
Dialect structures" and that "changes made by the subjects were surface 
changes (and) retained the meaning of the original sentences (p. 838). 

Previous research findings suggest that, teachers consider the speakers o 
nonstandard dialects to be less adequate in their speech than speakers of 
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Standard English (demote. 1 971 ; Whitehead I and Miller. M»w «J» 

correlations between ratings of speech and predictions Of academic 
'scores, intelligence, and academic readiness. toa ..... rp o Bon se to 

• TkLa ic (mm thp rpsearch then, a suggestion that teachers responbe iu 

fhose ^Oflen ItaSri in reading" instruction, manifesting in what Goodman and Buck 
M 97^ have termed linguistic discrimination. it 
ThP emoirical evidence for" such a conclusion is, however sketchy. \\ 
w as the S oMheVesent study to determine whether or not the presence 

SEss aetesas nr .sowars 

If so. is this equally true for good and poor readers? 

Method / 



SU ?£^ubiects were 84 students enrolled in three different sections of a 
orlduate couse on methods and materials for reading in the elementary 

reading. 

"Sta story by dark (1966) .he ^"t^t 

sffw S'SSS 

nnnr reader (SEPR) and Black Dialect poor\eader (BDPR) Ea ch variation 01 
S text contained fifteen changes from the original but following toajfar 

nctSUw^meaning-changSgmiscues^ 
..rescues (NMCM). good reader ^ersjons (SB3R a *J JP»J aj^jeader 

•versions were prepared. SEGR ™.^ u « M ™ sl °"°^ npffl contained ten MCM 
mmpm vA/hilP the Door reader versions (SEPR and BUHH) contained len wyw 
and PfKe NMCM OMhe NMCM. all versions shared five, with the second five 
for SEGF^pical of a Standard English speaking good reader's m scuds and 
the second file "for BDGR indicative of Black Dialect syntact.c features (e.g.. 
double negation, progressive be). rprorded bv adult 

each reader type present in each set but w th order ™ ffl same 

««^S15 iSened ,o .U 
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Table 1 
Miscues Across Versions 



Reader 



V 



Good 



Poor 



Standard 
English 



MCM - 5 1 
NMCM =y5 2 
NMCM^T 52 



MCM = 51 
NMCM = 52 
MCM = 5* 



Total Miscues » 1 5^ 



Total Miscues = 155 



Dialect 



MCM = 51 
NMCM = 5* 
MCM = 5* 



Black 
Dialect 



MCM = 51 
NMCM = 53 
NMCM = 53 



'Total Miscues - 1 55 



Total Miscues *= 1 5 5 



1 Meaning-changing miscues: shared miscues, all versions. 
2Non-mean»ng-changing miscues "typical" of Standard English speakers. 
3Noh-meaning-changing miscues reflection syntactic patterns of Black Dialect. 



^Total miscues for 255-word passage. 

version and poor reader version as read by the same adult. Individual reader 
speech features, other .than those controlled through MCM and NMCM, were 
hot controlled. 

Procedure 

The following, scenario was established: subjects were told that the 
investigators were jnterested in exploring the factors that shape' a person's 
diagnosis of a reaaer, and that by listening to tapes of readers, we were 
approximating a typical first step in classroom reading evaluation. Each subject 
was given, four copies of the original story text and was told that (s)he 
would hear four children, eactf reading the material. Subjects were told to 
make whatever notes they felt necessary on the copies in order to rate and 
rank the readers 

The task for all the subjects was the same. On the first day of class, each 
subject listened to a tape recording of four female- "children' 1 reading the same 
255-word selection. After hearing each reader, the subject completed a 10-item, 
five-point bi-polar adjective scale to rate the reader. Examples of dimensions 
on the scale are: the reader uses proper phrasing ... does not use proper 
phrasing; is confident is unsure; seems to read with meaning ... is a word 



^Meaning-changing miscues "poor" reader version. 
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caller After a subject heard all four readers, (s)he was asked to rank order the 
founders with the best reader ranked 'first, and the poorest redder ranked 
fourth • ' 

, Results 

^^^^^^^ 

ccrn oppp rhrr and BEPR readers, n addition, each suDjeci ranK oruereu 
fhe ?ou? readers (site heard. Table 2 contains the means and standard 
devia°ons of the Smated rating scores and the ranking scores for all- 
subjeqts for the four reader types. v 

Table 2 

Means and Standard Deviations: Rating and Ranking Scores 





Ratings 1 




Rankings 2 




M 


SD 


M SD 


Standard English Good Reader 


32.85 


9.56 


1 .71 939 


Black Dialect Good Reader 


27.56 


6.94 


2.28 989 


Standard English Poor Reader- 


23.21 


7.78 


2.96 884 


Black Dialect Poor Reader 


23.45 


7.86 


3.03 111 



•1 Rating scores could range from 10 to 50. summation of ten 5-point items. 

* 2 Ranking scores/ range from 1 to 4' (with 1 being the_ most positive and 

4 the least positive). 

-obtained between readers for both the ratings X2 (3) = 57.44, P<.05, ana 

• oDtainea Deiween .cauc o comparisons of both the 

Zmg and Vanking data; good readers as a group were, ranked 

more positively than poor readers. The SEPR th and QP B ^ R an r 5 a g^ R W f e | d e°s 
rankPri differently from each other. However, the SEGR and BDGH reaae s 
Zera^Z^er^ with the SEGR reader evaluated more positively 

'"Hwo-Cwo 5 ' multivariate analysis of variance was also done with the 
dependenwldablS being "the four summated ratings of the four readers, and 
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the independent variables being race of subject (Black vs. White) and whether 
or. not a person had taught reading. No significant differences in rating behavior 
were found due to either of these factors or their interaction. 

Conclusions ; 

The results of this study suggest that teachers may confuse oral language 
features of Black Dialect speaking readers with reading ability. Black Dialect 
non~meaning-changtng syntactic features embedded in the oral reading of 
•otherwise good readers appear to have been salient in teachers' judgements 
of thece readers. Support fOT this conclusion is 'found in both the rating 
scale date and the ranking data; Black Dialect speaking good readers were 
rated ahd ranked significantly lower than their Standard English^ speaking 
counterparts. 

It is interesting ihat this finding did not hold for poor readers; BDPR and SEPR 
ratings and rankings were not significantly different. Aq interpretation of these 
differing results is possible if it is assumed subjects were "looking for mistakes." 

The poor reader tapes contained more meaning-changing miscues than did 
the good reader tapes. The dialect-based miscues may have only, affected 
judgements when these miscues Co-occurred wjth a relatively small number of 
meaning-changing miscues. But when surrounded by a large number of meaning- 
changing miscues, the dialect-based miscues: may.be [ess obvious and their 
impact on judgements lessened. t >a 

These findings provide sorhe^empirical support Tor the hypothesis suggested 
by Goodman and Buck (1973) and Simons and Johnson (1974) that it is 
teacher response to language different children, not dialect difference per se, 
that' may be related to many of these children's failure to learn to read. 
If teachers look for errors inthe oral reading of children and consider syntactic , 
features of the oral language of nonstandard speaking children as errors, 
then these non-standard speaking, childrerf may be judged as poor readers. x 
As Goodman and Buck suggest, such a judgement may result in teachers 
moving children away from their linguistic competence by requiring word-for-word 
accuracy. 

There are .several implications suggested by these findings. First, if teacher 
judgements about the reading abilities of nonstandard speaking children are a 
Result Qf lack of knowledge of the features of nortstandard dialects, then- 
teacher education must provide such information. Protocol materials by Love 
(1973) have been helpful in promoting such knowledge (Ramig, Granger & Neel, 
1976). . „ - 

However, if judgements about threading ability of nonstandard speaking 
students are a result of a generalized stereotypic response to nonstandard 
dialects, then perhaps knowledge of tfoe features of nonstandard dialects is not 
sufficient. Techniques for modifying such stereotypic expectations have been 
discussed by Billiard, Elifsqk and Rubadeau (1 976). , 

Finally, if subjects' responses to the oral reading of children result from aw 
attitude that only word-for\ord precision is good reading, then study of. 
psycholinguistic conceptualizations of the reading process is probably appropriate^ 

Teacher education ought to proyide opportunities for teachers not only to 
learn about language and language differences as they re|ate to reading, but 
also must provide opportunities for teachers to develop human and positive 
attitudes towar.d language different children. Given appropriate learning, teachers 
may be able to distinguish reading ability from related but non-essential factors. 

3} 
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The effects of differing materials on the reading process 

The variability of the 'reading process across differing materials merits 
careful study, since understanding of the nature and degree of such va nab, My 
has direct impact upon teaching strategies. Although discrete reading 
□recesses for differing materials have been proposed (Gibson & Levin 1975 
82355! 1975). basic research is warranted to describe the interaction, of 
readers with such materials (MacGinitie, 1975-76). e ., etem « within 

There are five systems that cue meaning in reading: (1) cue systems w thin 
worSl (2) cue systems within the flow of language at the .sentence iteve 
(Smith Goodman* & Meredith. 1970). (3) cue systems within the flow of 
connected discourse, including intersentence grammar semantic 
redundancy beyond he sentence level, and organizational structure 
characSc oTvarLs kinds of prose (Neuwirth, 1976; Meyer. 1-976) (4) cue 
systems within the reader, inclgding the exper.ent.a background and- 
c'oncepLaT Lbilities. scripts, and/or schemata , (Pearson & Nicholson 976 
-Anderson Reynolds. Schallert. & Goetz. 1976; Sghank, 1972) anq (5) cue 
svstemrexternal to both language and the reader, such as pictures, charts. 
IS^JhiTshould be noted that not all cue systems are available -at every 

Ste <§ther h f e aSore may also affect cue system utilization. Barr (1975).and Harste 
and Burke ^( 976) have suggested that the focus of reading instruction greatly . 
aHects Se uti Sation and hence reader strategies for dealing with print. This 
e^fw&Staii noted .for poor readers, with good,readers seemingly 

• '^^X^lA narrative may also be significant tcj the 
reader Literal narrative may be described as more emotive in tone 
WhtelwS ^954 and" more abstract (Moffett & Wagner. 976) than 
historical narrative; the possible effects of these differences upon the reading 
process have not as yet been examined in research. • 

The present studies involved in-depth investigations of cue system 
utihzltioh by three groups of readers: second-grade students, ninth-grade 
S and mature adults past the age of sixty. Second-grade readers read 

material which varied according to organizational structure, while ninth- 
grade and mature adult readers read literary, and ^^JS^SSS^Si 
I studies yielded a view of striking similarities fn cue system utihzationacross 
wide age variations. ' k. 



Method 



Sa Zu!tol Six readers from one second-grade classroom were involved in this 
studTThe toaSffi^Sed to rank order all the children in her classroom. 
Children i who had^he same kindergarten and first grade teachers were 
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identified within this ranking and a stratified random sample was drawn, 
consisting Of two children with similar instructional histories from the top 
middle, and lower one-third of the ranking. 

Study II Six ninth-grade English students who were considered proficient 
readers Were selected. The teacher was asked to identify and rank-order her 
too ten readers and the top six were chosen. Although It was not possible to 
precisely reconstryct the instructional histories of these readers, an interview 
concerning their perceptions of the reading process and preferred reading 
strategies showed tftat they generally shared the same set df views. . 

Study III Four non-institutionalized persons, ages 62, 64, 65, and I 82, wno 
were rated as mature readers (Gray & Rogers, 1956) were selected for this 
study Data from an initial interview were examined by a paneMamiliar with tpe 
Gray and Rogers (1956) scale, and this panel determined the reading ma urity 
of the study participants! The same interview format employed in Study II was 
used to collect data concerning perceptions of reading and preferred reading 
strategies. This data indicated that the four persons held similar views and that 
these views tended to coincide witti those of the readers in Study II. .> 

Stody $ l Three selections were chosen for this study, all Of which dealt with 
turtles (Scott Foresman, Le,vel 6, 1971). The amount of available prior 
knowledge was therefore cortstant across the three selections. Stories were 
analyzed for readability using the Spache formula and were found to be highly 
similar at approximately 2.6, Materials were selected on the basis o 
readability and because they represented typical formats found in social 
studies materials? science materials, and literature. 'To verify selection 
choices copies of all three selections were given to one class of under- 
qraduate students in elementary education and each student was asked to 
classify each selection according to discipline. "Turtle Rescue was identified - 
bv 83% of the students as a social studies selection, 100% identified Kinas 
of Turtles" as a science selection, and 93% identified "Clever Tunle as a 
literature selection. _. ., ' 

Studies II and III Two selectipns, both about the American Civil War, were 
selected for Studies II and III (Crane, 1952; Brown, Robinson, & Cunningham, 

" 1974) thus pertinent prior knowledge , was held constant for the^selections^ 
The Dale-Chall and Flesch readability formulas were used to determine that 
the selections were comparable in terms of readability; both selections were 
rated as 7th to 9th grade material by both formulas. Both narratives employed 
a strarfrTtforward chronological pattern of organization. Neither provided 

. extralmguistic cues. The significant difference in the narratives was aken to 
be that the literary selection is relatively more emotive and abstract than the 
historical selection, which presents a more nonemotive and concrete 
treatment of events. 

Study /' Procedures described in the Reading Miscue Inventory (RMI) 
Manual (Goodman & Burke, 1972) were used in collecting data from each of 
the six readers The three selections were read by each reader in a single one- 
hour session, and the order of selection presentation was varied across 
readers. Reading sessions were conducted by a single researcher in a 
teacher's lounge. . , 

Study II Data collection procedures for this study are similar to those 
described for Study I. Since the two selections used in this study are 
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considerably longer tfran those used in Study I. two reading sessions were 
for each reader; m the intial session, the reader was interv.ewec I. and 
mtd and Sold one of the two selections. The order of selections read was 
alternated Across readers. All reading sessions were conducted .n a quiet 

t0 °$tudy III The procedures used in this studyare identical to those described 
in Study ll, except that all data-collection sessions were conducted by the 
researcher in the homes of the readers. ( 



. t'itfjt i worksheets^oT^hlele^tionwere prepared following Procedures , 
as^desct brd I in the RMI Manual. Standard miscue counting procedures were 
SlS I wfth 'thi exception that partial corrections were canted as miscues 
and coded as evidence of within-words cue utilization The first 30 m'scues 
rtfade by a subject (or all-miscues when less than 30) on each of the three , 
• selections were analyzed. Following this procedure produced 398 m.scues for 
analysis Twenty-four questions were asked of- each miscue. v 
Sues we e coded in terms of the reader's utilization of each of the five 
cue systems When there was evidence that cue systems within words Jad 
been Shzed the miscue was coded as partial word (letter-sound I r*toomh ps 
oTaffixes) or whole known words (substitutions, omissions, or the production 
' S knownwords within the text word on -which the miscue occurred). Use erf 
synK and semantic cues within the flow of language were joded 
arcordina'to whether these cues were at the sentence level or the discourse 
U fevel The 9 reaWs u il zation of preferred oral language structures and lex.ca 
V fem "^feTenSs was- taken as evidence of his bringing h.s conceptual 
nders andjnq of language and the world to the task, and were coded w.th.n 
' Z SSLX^ s^Sns within the reader. Extralinggistic cue system use 
was noted on the worksheet at the time of taping, and coded as pictures and 
other such cues as charts and graphs. . 
Ten additional questions coded which of the -five cue systems had been 

' U ^slTfe^ they resulted in meaning . change. 

\ Meanin'ch^nge w?s L coded as Yes No, or Partial, using decision rules 

\ eSt £^ utilization and effectless 

select! as the SS^TW^iieX permitted the analysis of rniscues at 
XUeS^sen^S, and intersentence level' afa resulted in a minimal loss of 

in, °n rn addmon to these analyses, one additional calculation was made _ This 
was rniscues per hundred words (MPHW), the ratio of m.scues made to the 
' Total number of words in the selection X 100. This procedure equalizes miscue 
counts across selections of various lengths. ' 

StudL II and III. Data analysis procedures were nearly a 'den 'caJ for these 
two studies and similar to those described for Study A to al of I 105 m.scues 
.analyzed in Study il and 268 rniscues were analyzed in Study III. Reading 
rntervSw data were examined to permit comparison of described and actual 

re rS e %e^ retelling scores according to 

the^SS SSE! as we,, as by ^"^^,^2*2 
to whether it resulted in meaning change; and if so, whether th.s change was 
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minimal or substantial. To equalize miscue counts across selections of 
different lengths; the number of miscues per hundred words (MPHW) was 
calculated. Indices of the readers' tendency to recover from miscues which 
disturb meaning were also constructed in these studies. In Study II, residual 
miscues per hundred words (RMPHW — the ratfcaf miscues which disturb 
meaning to the total number of words in the selection X 100) was used as such, 
an index. Study III used the percentage of residual miscue sentences (those 
sentences in which an uncorrected miscue caused meaning change) as sifch 
an juries Finally , iruStudy. U the percentage-oUesHPessions- made- by each 
" reader at the morphemic or sub-morphemic level was calculated in an effort to . 
; illustrate the tendency toward the use of cue systems within words. 

Results and Discussion 

The balance of this report will present those conclusions common to all 
three of the- studies and samples from the data supporting them. Some 
important implications which arise from these studies will also be noted. 

Consistency of Cue System Utilization 

Readers in all three studies tended to use the same cue systems in ^ 
consistent fashion regardless of the nature of the materials read. Ellen; a 
second-grade subject in Study I (see Figure I), made extensive and consistent 
use of the within-words cue system on all three selections, relying .much more 
heavily on whole words than partial words. Vince was the most consistent user 
of the various cue systems across the three selections (see Figure 2). These 
two subjects., utilized discourse cues, cues within the reader, and 
extralinguistic cues much less frequently iq the three selections. Ellen's use of 
pictura cues in 10% of her miscues on "Turtle Rescue" represents the 
greatest use, for both these subjects, of information from these systems. 

Trie pattern of consistent preference for word level cues across selections 
also appears in the data from Study II (see Table I). Debbie's miscues 
exemplify the overall results. For the entire sample, substitutions tended to 
oear high phonetic and graphic similarity to the text word on both of the 
selections^ regressions on both selections also tended to occur at the 
morphemic and sub-morphemic levels. Relatively few miscues on either 
selection resulted in, fully acceptable semantic structures and few indicated 
grammatical strength. Finally, the rate of miscue occurrence (MPHW) was 
similar for both selections. . ' „ , 

Study III produced similar results from the same selections used in Study II. 
50% of Blanche's substitution miscues (see Table 2) on both of the-selectpns 
were highly similar to the text word in terms of both graphics and sound. Her 
rate of miscues per hundred words (MPHW), while' far Ipwer than ttjgt of 
subjects in Study II, was consistent across selections: 16 tor -the literary 
narrative and 2.1 for the 'historical narrative/All of the subjects in Study III 
produced a relatively high percentage of miscues on both selections (65 /b + ) 
that were fully acceptable syntactically and semantically. This would indicate 
that while the Older subjects were very consistent in cue system use across 
dittering materials their preference's for word cues w6re neither as strong nor 
as detrimental to the use of other cue systems as were such preferences 

a TleLCeTence?oTword cQes across samples-and selections did appear 
^I^^TZ^usb of cues from other systems and comprehending 
eftectivJriss in all three studies. In the preceding data, it can be seen that the 
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better comprehenders. These readers did, however, demonstrate consistency 
in their patterns of cue system utilization. 

Importance of the Reader's View of the Process , 

All subjects' reading performances tended to reflect their concepts of the 
nature of the reading process more clearly than the differences in materials^ 
Vince and Ellen seemed to consistently view reading as a word-centered 
process regardless of the .material they were reading, while other, more 
flexible readers in Study I were characterized by their strong and consistent 
reliance upon sentence and discourse level cues, and their conceptual and 
linguistic backgrounds. The interview data collected in Study II revealed that 
all subjects in that sample, like Vince and Ellen, tended to view reading as a 
process of accurate word recognition, and the oral reading data from this 
sample which was previously discussed indicates lhat reading performances 
were consistent, across both selections, with these stated perceptions of 
reading Interviews among the older subjects of Study III revealed a significant 
concern for accuracy of word recognition, which was also apparent in their 
oral reading performances on the two selections. The reader's view of the 
process does seem to determine patterns of cue system use to a much 
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Table t \ 

Cue Utilization of Effectiveness in Study tl 



Literary Narrative Historical Narrative 



% of substitutions with 
high phonetic similarity 


DM 


DQ 




66 


65 


% of substitutions with 
high graphic similarity 


67 


DO 




71 




% of total regressions 
occurring at morphemic 
and submorphemic levels 

% of miscurs resulting in 
full semantic acceptability 


7Q 






ou 


67 


1 D 






32 


40 


% of miscues indicating 
strengths in preserving 
grammatical relationships • 

% of miscues resultina in 
meaning change 


97 


^1 




40 


48 


68 


DO 








miscues per hundred words- 
(MPHW) 


15.12 


6.94 




13.47 


6.39 


* 

residual miscues per hundred 
words (RMPHW) 


10.85 


4.19 




7.28 


2.76 


retelling score 


12 


19 




2 


10 



greater extent than the differences among materials from various content 



areas. 



from these studies suggest that the reading process is relatively stable 
ac?ots conTent arias, and thai the wide* held notion <*Wm^™™ 
in the demands made by content materials (Piercey. 1976, Robinson, 1975 
Gibson & Levin, 1975; Herber (1970) needs- reexamination. However, strong 
concern for accurate decoding y/as related to poor comprehending, even 
amtina the hiah-school subjects and those past the age of sixty. 

Whither of not the preference for attending to cues w.th.n words at the 
exoense o mean ng is an artifact of instruction is an important question that 
S fur°heTstudy 9 The present studies, however. ^XT^Seoi 
need for instruction which encourages readers to apply their knowledge ot 
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language and of the world in making greater use of all available cue systems, 
and avoiding overretiance upon those cues which, inflexibly used, lead to poor 
comprehension- • 



Table 2 



Blanche's Cue System 
Use and Effectiveness 
(Study III) 



Literary 
Narrative 



% of substitutions with high 
phonemic similarity 

% of substitutions with high 
graphic similarity 

% of miscues indicating strength 
in preserving grammatical 
relationships 

% of sentences with miscues that 
' disturb meahing 

miscuesfper hundred words 
(MPHW) 

retelling score 



50 



50 



76 



1.6 



19 



Historical 
Narrative 



50 



50 



57 



2.1 
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• Effects of impulsivity-reflectivity 
and type of phonics instruction on 
reading achievement 

Ausubel (1968) stated that no realistic system of teaching .could afford to 
overlook individual differences and that such differences were expressed in, 
among other things, general mode of cognitive funotions and approach to 
problem-solving situations. The impulsivity-reflectivity dimension of cognitive 
style describes one set of individual differences that affect thinking and 
problem-solving behavior in human beings. 

Kagari Rosman, Day, Albert and-Phillips (1964) postulated that impulsivity- 
reflectivity relates to the selection and evaluation of solution alternatives in 
situations of high response uncertainty. When presented with an opportunity 
to consider alternative solutions to problems, an impulsive child tends to act 
updn his initial response with little reflection. Conversely, a reflective child 
delays before carrying out a solution hypothesis and actively considers each 

3lt6rn3tiv6 * 

Since the task of reading presentb a situation of high response uncertainty, 
cognitive style may have particular relevance for reading. To be successful^ 
reconstructing meaning from print, the reader .must be a decisiOn-maker who 
selects appropriate linguistic cues, considers prior experiences and applies, 
his language skill, all toward generating selective, differential hypotheses 
(Goodman, 1967). Studies comparing impulsives and reflectives on the 
variable of reading achievement (Kagan, 1965b; Johnson, 1969; Shapiro, 
1974) have consistently demonstrated .that reflectives perform better than 
impulsives on tasks involving word recognition, reading comprehension and 
reading readiness.- (For a complete review of the impulsivity-reflectivity 
dimension of cognitive styje, see Readence & Searf oss, 1976.) 

There are two hypothetical explanations for the apparent fact that mastery 
over the initial stages of reading acquisition is facilitated by reflective 
behavibr, i.e., thoughtful analysis and a minimum of guessing. 
' H1; The nature of the reading process is such that children are at an 

advantage if they are reflective. 
H2 T«*chers and publishers organize reading instruction . in ways 
which tend to be advantageous for reflective children. 
If H1 is true it suggests that being reflective is, with respect to reading, 
••better" than being impulsive and that if impulsive children are taught to be 
more reflective they should also become improved readers. II H2 is correct, 
then it implies that impulsives are merely "different" from reflectives rather 
than deficient in some sense. It also suggests that the acquisition of reading 
skills and strategies might be enhanced for both reflectives and impulsives 
through manipulations of the learning environment, including instructional 

^Attempts at cognitive style 'modification (Debus, 1970; Denny, 1972; 
Jacobs 1974' Stein 1969) bave been moderately successful in getting 
impulsive children to respond more reflectively; but these studies have been 
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unaae ,0 demonstrate '.ha, modifying , impulsivify ( has .^SZ^Ss'T 

any achievement variable. Consequently. into ma » MP 

be unfavorable lo H I . On the other ran* '^™^,^ r ^S materials or 

5K8K WSCSSS^Xft -emotive phonios 
'programs. 

Original Analysis 

Karlin (1975) desonbed .wo ^ral approac ^o Pho™os inst.uoton, 

•Ihlm and instruction proceeds tromy.hole topart Q , sc0 , es 0| 

. in the original sludy.. vocabulary ■and reading ' ^ m P'«™X° using either 
impulsives and .effectives were compa ed m school ■ P™aM- g 

?n«mMer^ 
apparently inclined to ignore. . 

"iSec-s. The subiects to, the study were 2« ^^S.^ 
STfiSJ ^iTh^'^liW dne base, reading 

P '= m en,s. The Gates-McG^*, j^™2S»SE5 

a» pp F M 

personnel admintsje-ed « , | ^.li^^^fgh^ student 
reflective by the MFF »re eiimmaieu u h impulsives received 

:"ere y, ,a C u P g h h?b» anal»,ic phonios and 45 b» synmehc «~X«W«y and 
cd»ens^sresK^^ r ^ 
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Results 

There were no statistically, significant differences in reading achievement 
between subjects taught by the analytic and synthetic approaches. However, 
reflectives exhibited significantly ..higher levels of achievement than 
impulsives. F (2,165) = 3.55, p < .05. Significant differences also emerged in 
the multivariate test for interactiorvbetween cognitive style and approach to 
phonics, F (2,165) = 4.57, p< .05. 

: Univariate analyses for each dependent variable revealed that the 
reflectives had significantly higher scores on both vocabulary (M - 39.48) and 
comprehension (M - 26.30) than the impulsives (M = 37 21 and 23.79), F 
(1,166) = 4.44, p'<05'andF (1,166) = 7 14, p < ,01, respectively. 

: ". Since neither of the univariate interactions proved to be statistically 
significant* the multivariate interaction wg$ explained through separate 
multivariate artatysfes which compared reflectives with impulsives in each of 
- /.the phonics approaches. The results indicated that in the synthetic.approach 
reflectives (M - 41.33Vperformed significantly better than impulsives (M = 
37.54) only in vocabulary, F (1,93) = 7.76, p < .01. In the analytic approach, 

! reflectives (M - 25.72) had significantly higher scores than impulsives (M = 
22.77) only in comprehension; F (1,73) = "3.98,p<05: 

Discussion 

The results seemed to support the notion that reflectives tend to be better 
readers than impulsives, and the multivariate interaction involving cognitive 
style and phonics approach was interpreted to mean that* reflectives and 
impulsives are affected differently by various types of reading programs. 
However, the results provided no, support for the hypothesis that impulsives 
would find intensive synthetic phonics instruction particularly beneficial. 

New Analysis . 

The original analysis of the 'multivariate interaction resulted in a 
cumbersome set of statistics and no satisfactory means of visually 
representing the interaction. The present analysis is an attempt tocircumvent 
those problems through the use of "different scores" in a procedure by Levin 
(1977). The analysis is based on the hypbthesis that the achievement ratio 
involving vocabulary and comprehension in the originaUstudy was dependent 
upon cognjtive style and>instructional approach; that is, fo> a given cognitive 
style group in a given phonics approach, vocabulary scores rrfght-be relatively 
high while comprehension scores were relatively low; or the reverse could be 
true. 

0 

Design • 

Vocabulary and comprehension subscores from the GMRT were 
transformed to Z scores. For each jsubject, the standardized comprehension 
score was subtracted from the standardized vocabulary score. The resulting 
difference score was then assigned to the appropriate cell in the 2x2 ANOVA 
design. Main effects were approach to phonics and cognitive style, and 
- difference scores constituted the dependent measure. 

Results 

There Were no significant differences between reflectives (M - -.046) Bnd 
impulsives (M - .042), F (1,166) = 1. '00, p > .05; and there were no significant 
differences between subjects in the analytic schools (M - -.058) and those 
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taught by the synthetic approach (M = 045), F (1 166) = >^ D '" 
contrast, there was a significant disordmal interaction, F (1,166) - 8.82, p < 
.01 (see Figure 1) 
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Plotted Cell Means Showing Disordinal 
Interaction Between Cognitive style 
And Phonics Approach 
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Thlxuse of difference scores appears to be an efjcent Procedure for 
ascribing complex multivariate interactions. In this particular case, .t clearly 
toSertad that reflectives were relatively high achievers in reading 
com^eniSn bnder an analytic approach and relaUvely high ,n vocabulary 
when tauaht bv a synthetic approach to phonics. Impufsives had a reve sea 
^hievement patVrn They weVe relatively high in vocabulary in an analytic 
nrnaram and hiah in comprehension under a synthetic approach. 
P X7es^s are interp?eted to mean that childrer ; with different cogn, ,ve 
QtviP«i are sensitive to different instructional approaches in read ng. 
Fu fhermore the abhors find the results encouraging since it may be poss.b e 
to laSoT ou the characteristics of each instructional program which are 
5*5? beneficial to each cognitive style group. Once this is accomphshed 
SiShzed programs of instruction in reading could incorporate the best of 
eachlnstructiSnal approach in order to maximize readmg ach.evement for 
both reflective and impulsive children. 
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The development of orthographic sensitivity during the 
school year by primary grade children 

A number of theories of reading explain that knowledge c* o/J.ogaphic 
Nurture is used bv readers to help them process text efficiently. (Gibson 
and Levin 197? Smith. 1971). Although .orthographic senses regarded 
as important in the reading process, there is disagreement as to when this 

^Smn^Tffels that the ability to make use. of orthographic structure 
devSs a Tea |y as «Lt grade. Studies by Lott and Smlth,(1971). Niles (1976) 
and Ss "SXdS and Wimmer (1977) support this viewpoint. Lott and 
Smith I 970rreDorted that first grade children were able to recognize letters 
aUowe VZS^vJStSfs were in words instead of in isolation. N.les 
0 97?) fouTd that first grade children used less visflal informal on , to «gQJ« 
word- ike pseudowords than non-wordlike pseudowords^ Niles Grunder and 
vvimmer (1977) found that end of the year first grade children performed above 
ctance or a task which involved selecting^ more word-like Pseudowo d out 
of pairs of zer(M>rder and fourth-order pseudowords. Kindergarten children 
oerfo med afchance level on this task. From, these studies it appes.s mat 
oTthog7aphic sensitivity begins to develop at the earliest stage of learning 

firs ^rade children performed at chance level on a task of selecting the mo e 
™&towuto^«om pairs of pronounceable and unpronounceable 
D?eudowor* ^ TWrd grade children, in contrast, performed at better than cnance 
Tn a s^mS studv Golinkoff (1974) found that students at the end of first 
Lrade Srformec I only slightly better than ch^ce on a task of selecting the 
l^w^l£^«*«oni pairs of visually presented pronounceable 
andunprTnounce P ab.e pseudowords. Students a < ^.^^^^ 
frvmPd at a level siqnificantly better than chance'on this task. These stuaies 
■ suaafst that S grade children 'are not sensitive td orthograph.c structure 
St studen!s at the end of second grade or in third grade have developed 

"V^'fu'dfefcSaLve present an uncfear picture of the > development of 
sensmvi y to orthographic structure. Because the studies were based on different 
cross-sections and tested for orthograph.c sensitivity on different 

Ssks at different times of 'the year, it is not surprising that there are conflicting 
fndinas An extenSve study which is longitudinal as well as cross-sect.ona 
■ f ou^rpro" id^f more'complete and accurate desertion of the deve opmen o 
nrthnaranhic sensitivity in beg hning readers. Therefore, in the present stuay 
tSSSffSX^ examined over time on a task measuring olographic 
stSre an important text processing strategy, in the early stages Of reading. 
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Additionally, this study will attempt to examine the relationship between rdading 
ability and sensitivity to orthographic structure. 



Method 

' Twenty first second, and third grade children were randomly selected from 
each of the three grades in a rural elementary school. There were about 35 
students in each grade at this school. Ten sets of eight-Jetter approximations 
to Enqlish (Miller, Bruner, and Postman, 1954), zero-order (hhjhufsw) second- 
order (riprypli), and fourth-order (inforems), were presented individually to the • 
subjects in the second half of October, May, and the following October. Attrition 
Over the one-year reduced the number in each group to 17. Each subject 
was tested individually and asked to select the one pseudoword from a set of 
three (zero second and fourth order) which looked most like a real word. Alter 
making this first selection, the subject was then asked to select the pseudoword , 
from the remaining two in the set which Jooked most like a real word. 
The set members as well as the ten sets were presented in random order. The , 
ranking responses were scored using a Tour point scale for the six possible 
'rankings A chance score for the task was considered to be 25 and a ceiling 
score obtained from a group of college-educated adults was 39.37. At the time 
of the final testing the subjects were administered the reading comprehension 
subtest of the Peabody Individual Acnievement test (Dunn and Markwardt, 
1 970) A 3 X 3 (grade X time) factorial design was used with repeated measures 
on the second factor. Data were formally analyzed using an analysis of variance 
procedure and Newman Keuls Test for significant mean differences. 

■Results 

Table 1 presents a summary of the means and standard deviations for 
grades 1 , 2, and 3 by the.specific time of the measurement. An examination of 

Table 1 

Means and Standard Deviations of Ranking 
Responses by Grade and Time 



Time 


• 




Grade 








1 


2 


3 . 


FalM 


M 


< 26.19 


31.38 


33.5 




SD 


3.59 


4.53 


3.09 


Spring 


M 


27.94 


34.19 


34,31 




SD 


4.0 


• 3.28 


. 4.28 


• Fall 2 


M 


31.0 


34.94 


35.88 




SD 


3.75 


* 2.96 


186 


Total . 


M 


28.38 


33.5 


34^56 
t— 
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the means within each grade level suggests a; developmental trend I in the 
acquTsition of sensitivity to orthographic struct^. The age-related nature of 
orthooraph.c knowledge .s reflected in the overall means for each grade 28.38 
for first 33 5 for second, and 34.56 for third. A formal analysis of this data 
reveaed no 'significant interaction between variables and « is gr » bant main 
effect for both grade level and time F(2.45) = 18.8(7. p< 01 and F(2,45) - 19.93, 

P< Th°e 1 Newman-Keuis test revealed a number of significant differences among 
means ^iJiflSnt differences were p<.05. ■'While the within grade means 
Creased on'each subsequent measurement, only the n Jg^ means^ll 
differed significantlyfrom each other. Second grade means differ ed significantly 
fmm Fail i to SDrino but not from Spring to Fall 2 rrteasurement. Across grades, 
ETtE g ade P me 9 ans were significantly different from all other means .n 
elerv case with the exception of Fall 2 to second grade ,Fal 11. Fall 1 for 
second T gfade was the only second grade mean to reflect a difference when 

TS^^^S 25. A f test confirmed thaUhe Fan 1 rn^ 

compared with the chance score, (p<- 01). «« m Ur-»-nQinn subtest 

The Pearson correlations between reading ability (the c °™P rehe ™°"™°}™ 
of the Peabody Individual Achievement Test) and the orthographic structure 
task were 66, 49, and 60 for the Fall 2 scores of the first, second, and third 
grades respectively and 68 for all grades. 



Discussion 



• These findings clearly support the notion that, in general, sensitivity to 
orthoo^hic structure is acquired by readers during the latter part of first 
a rade Peven though the differences among cell means suggest that the growth 
o knowledge fS orthographic structure was rapid throughout first grade AH 
conditions for the first grade were sign, f, can ly different from Jfj^jj^ 
second orade children demonstrated significant growth from the fall measurement 
to the sonng measurement. At this point 'acquisition apparently slowed a* 
he e was no significant difference between the end of the year second grade 
ch! dren and the final test on them when the.y were beginning third grade. 
fhfacaulsLn ra e slowed even more for .the third 4 rade children as they 
Sowed I no signficant growth from the Fall 1 to the, Spring measurement 
However the third grade children did reflect signified growth when the 

fa ^Xol%™ SE SKlXJ argues against the no?fon that sensitivity 
to or htSraohic sS does not begin to develop until the end of the second 
arade S Mo ?h is task lhe growth rate for the acuqisltion of orthographic 
knowfedqe begins to slow down toward the end of second and beginning of 
Jhi? T gfade I ? would seem that with the onset of learning to read, which 
oeneral v occurs in first grade, the reader begins to rapidly develop orthographic 
^S^?**M*»te to develop fluency in reading Ward the .end of 
the Third grade, 'the acquisition curve for orthographic sensitivity at least for 

hfs task begins to gradually move toward an adult performance level. 

Most iikely the marked difference in findings between this study and others 
that document the onset of sensitivity to orthographic structure *t a latter point 

s elated to the type of stimuli used. While eight-letter pseudowords were 
use?? th ^ present task, shorter stimuli were use'd in other investigations. 



For example Rosinski and Wheeler, (1972) used three, four, five and six letter 
Dseudowords and Gibson, et.al., (1963) used four and five letter pseudbwords 
K is aSte possible that the stimuli in these studies were not sensitive enough 
o the first and second grade students to demonstrate their orthographic 
Llwledg The f longer stimuli may have provided. the *«f^Xoo1S 
which enabled the first and second students to make more accurate orthographic 



The relationship between reading ability and the orthographic task also 
supports ?h Xngf o? early development of sensitivity to orthographic 

r £ If reading 9 ability and the task are correlated, it seems tha one 
mgh logically expect to find evidence of orthographic knowledge at almost 
an? 00 nt a? whichneading ability is discernable. While beginning readers may 
no f ff as knowledgeable lout orthography as fluent reader* ^ unreason b e 
to assume that this knowledge base is as slow in developing as other researchers 
have suggested Knowledge of the orthography helps make reading easier and 
the inherent tendency of the human information processor seems to be to 
make the perceptual process as efficient as possible. 
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Experiments in word learning 

Many significant models of beginning and mature reading < ^^ he ir featural 
componen?s appear in the ^^l^^i^^^ 
Se,s GU arr gen era*?' LTdiscus^'T Vofessiona. ; journals and 
JSSe encet they are not usually widely researched. Often it appears that 
onlv me Primary authors or their closest students do a systematic evaluation 
oHhe JrSiSSd mSS. This is not an indictment, but rather a conclusion that 

led G Tbs^~ - a " com > le * 01 ,ea,ures nic a 

ooSS3t^S?nitJ of five, classes * ( ^™^ etlates 
phonological, orthographic, semantic, and syntactic £ 194). ™^?™2ai 
according to Gibson and Levin, may or may not be extracted in the perceptual 

^^^ot "therefore is tp determine on what basis beginning readers 

X^^'^^^^^- speci,ica,,y ' ,he purposes of 

domain to °he visual memory' of words for the graphic features of lower- and 
uppercase script? differentially in the.phonologica! 

domain te ? fXffiwlFy pronounceable words and phonologically 
unpronounceable words? differentially in the 

orthograph°ca. domain ?o 9 ortho 9 graphically legal words and orthograph.ca.lv 
"legal words^.^ readers ' respond differentially in the semantic 

in the Syn,aC,iC 

domain to nouns and verbs? ■*> 

Sample 
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Cognitive Abilities Test. Finally, the results of the two screening tests were • 
collapsed and 10 students having both ayerage or above readiness and 
intelligence levels were randomly selected from each school for the final . 
experimental number of 20. Accordingly, the students could be predicted as 
. possible successful learners based upon the two screening tests. 

Experiments 

Experiment 1 

Experiment 1 was designed to investigate whether beginning readers 
respond differentially in the graphic domain to the visual memory of words for 
the graphic features of lower- and upper-case script. 

The task consisted of a delayed matching-to-sample procedure. The stimuli 
were five, three-letter and five, four-letter real or legal nonsense words. 

The words were selected by choosing a key word (stimulus), then changing 
the first letter for one response choice, changing the last letter for one 
response choice, and presenting the same word as the remaining choice (for 
example, MAP: MAT, MAP, BAP). 

In order to distribute possible practice effects across the learning trials, 
• children were randomly assigned to lists and rotated by receiving upper case 
and lower case and vice versa over a perio|rt>f two days. A single "word" 
printed in primary type was presented on a Keystone Tac-ette tachistoscope 
for a one second exposure. Then an array of three randomly afranged 
response words printed in primary type was presented on a 5" x 5 " card to the 
subjects. There were three practice items for both lists. 

The mean numbers of correct responses for each of the two lists (lower and 
upper case) of Experiment 1 are presented in Table 1. Since each student 
served as his own control, a t test for the significance of the difference 
between correlated means for the lower- and upper-case lists was conducted. 
The first hypothesis was accepted on the basis of the finding and there 
appeared to be a difference in memory of word fctrm with lower-case letters 
* being recalled more frequently than upper-case letter, t (19) = 2.39, p < .05. 

Experiment 2 

Experiment 2 was designed to investigate whether beginning readers 
respond differentially in the phonological domain to phonologically 
pronounceable words arid phonologicallyxinpronounceable words. 

The 20 children in the sample were taught by a prompting technique the 
sounds of five letters in isolation (hard g, a, b, t, and n). After the children were 
able to produce each sound when the letter was presented to them on a 5" x 
5" card, the test items were administered by the examiner. 

There were 10 items in the task consisting of 3 words, 2 of which were 
unpronounceable. All of the words were composed of 3 of the letters which 
had been previously taught to the children, Throughout the 10 items 1 
pronounceable and 2 unpronounceable variants of the 3 letters were randomly 
arranged on 5" x 5" cards (for example, atb, tba, tab). 

Each student was seated opposite the examiner. S/he was told that s/he 
would be shown three groups of letters and was instructed to point to the one 
he could say or pronounce. 

Raw scores consisted of the number of identifications of pronounceable 
words. The mean (M = 4.4) which was computed for correct responses is 
' shown in Table 1. An independent t test was utilized to determine whether the 
mean* number of correct responses was significantly greater than chance 

'/* 
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' Table 1 

Means, Standard Deviations, and Mest Values 
for the Variables of Experiments 1-5 



Variable 


M 


t 

SD Value 


df 


( 

P 




L 


Experiment 1 






Upper 


5.15 


1.93 

2.39 


19 


p<.05 


Lower 


6.40 


'2.30 






Experiment 2 • * 


— — 

Pro- 
nounceable 


4.40 


2.37 

2.08 


19 


p > .05 


Chance 


3.33 


2.37 










Experiment 3 • 






Ortho- 
graphically 
Correct 
Words 


4.10 


1.83 195 


19 


* 

P > 05 


Chance 


3.33 








Experimegt 4 


Concrete 


6.10 


4.61 

.50 


19 


p>.05 ' 


Abstract 


6.70, 


4.38 






Experiment 5 


Nouns 


2.55 


1.57 

2.91- 


19 


p<.05 


. Verbs 


6.15 


5.41 







exDectancv (M = 3.33). On the basis of the findings of the analysis t (19) - 
expectancy^ o hypQthesjs was not accep ted and,, was 

determined that there was no significant difference between the recognition of 
^S^i^ficBeb)B^OT6g and phonologically ""P™^ 3 ^ 
words Phonology when measured by reactions to Pronounceable and 
unpronounceable words does not appear to be a task related to the 
recognition of words by naive beginning readers. 
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Experiments ■ 
Experiment 3 was designed to investigate whether beginning readers 

respond differentially in the orthographic domain to orthographically legal and 

orthcftjraphically illegal words. 

A matching task was devised in whith an abstract picture was presented 

along with three nonsense #ords. Two of the nonsense words had illegal 

orthography and one had legal orthography (for example, kcagl.-ckalg, glack). 
There were 1 0' items with nonsense words of 4, 5, and 6 letters in length with 

the legal word being presented in a random position in the list of 3 words. The 

words were typed in primary type and presented along with a picture on 5" x 

8" index cards. 

Each student was seated opposite the examiner and was told to Doint to the 
word that "tells what the picture is." S/he was. instructed to guess if s/he was 
not sure. After each response the examiner said "okay" or "all right." 
. The mean number of cowect responses ' (M = 4,1). was tested for 
significance by an independent t test to determine whether it was significantly 
greater than chance expectancy (M = 3.3). Results of the f test are shown in 
Table 1. On the basis of these findings,.? (19) = 1.95, p > .05, the research 
hypothesis was not accepted and it was ascertained that there was no 
significant difference between the recognition of orthographically legal and 
illegal words by beginning readers. 

Experiment 4 

" Experiment 4 was designed to investigate whether beginning readers 
respond differentially in the semantic domain to concrete and abstract words. 

A criterion learning task was devised using six, four-letter words (three 
concrete: calf, park, dust, and three abstract: with, good, luck) selected from 
the Dale List of 769 Easy Words according to the following factors: (a) word 
length, (b) configuration! elements, (c) different initial and final letter, (d) 
freedom from obvious discrimination confusion, and (e) words not used in the 
beginning basal material in the cooperating school system. 

A flash device was used to produce the learning condition. The device was 
made from a desk telephone numbej index that allow:; for the exposure of the 
stimuli words in a predetermined random .order by sets (concrete or abstract) 
to provide a systematic randomization in order to spread learning effect 
across trials. . 

After determining that the students did not know the words (Trial 1), each 
child was individually taught the words by a prompting technique until all of the 
words were known as evidenced by a trial of three correctly pwnounced 
words The number of trials to criterion for each sef of three words was the raw- 
score used for the statisticaKanalysis. Task words by sets were learned at one 
day intervals with random assignment to the list to be learned first. 

The raw scores used for analysis consisted of the number of trials to 
criterion required by each individual for each of the elasses of words (concrete 
and abstract). The mean number cf trials required by the subjects for the 
mastery of each of the lists of words is presented in Table 1. Since each 
subject served as his own control, a t test for the significance between 
correlated means was conducted. Although the mean number of trials for the 
abstract list (M = 6.7) was greater than the mean number of trials for the 
concrete list (M = 6.1), no statistically significant difference was found in the 
analysis between the two groups, t (19) = .50, p > .05. It, therefore, appears 
that concreteness and abstractness are not determiners of word reading 
facility under a testing/teaching condition with naive beginning readers. 
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Experiments . 

Experiment 5 was designed to investigate whether beginning readers 
respond differentially in the syntactic domain to nouns and verbs. 

•As (^-Experiment 4 a word learning task was devised in which six common 
words (three nouns and three verbs) were selected according to the following 
factors- (a) word length, <b) configuratibnal elements, (c) different initial and 
final letters, (d) freedom from obvious discrimination confusion, and (e) words 
not used in the beginning basal materials in the cooperating system. 

The desk telephone index (described in Experiment 4) was again utilized to 
produce the learning condition. , 

The experiment, as in Experiment 4, was a trials-to-criterion learning task. 
The raw scores used for analysis (as in Hypothesis 4) t consisted of the number 
of Vials to criterion required -by each individual student for both classes of 
words (nouns and verbs). The mean number of trials required by the students 
for the mastery of each list of words is presented in Table 1 • Hypothesis 5 was 
accepted on. the basis of the finding of the analysis, in that the stua _ents 
required significantly more trials to learn the verbs than the nouns, t (19) - 
2 91, p < OS. Form class may, therefore, be a determiner of word learning 
facility under a perception/learning technique with young learners. 

Summary 

In Cgard\o the five questions posited for investigation the following 
significant results were noted. . 

Significant differences were found for lower- and upper-ease script with 
lower-case letters being more easily procesed in short term memory than 
upper-case letters and for the response alternatives with initial letters being 
more salient cues than final letters. A significant difference was also found 
between the learning of nouns and verbs with subjects learning nouns more 
easily than verbs No significant differences were found between the 
recognition' of pronounceable and unpronounceable words and 
orthographically legal and illegal words. Also, no significant differences were 
found between the learning of concrete and abstract words and learning 
words with or without pictures. 

It was concluded that the model of word- information cues as posited by 
Gibson a>d Levin (1975) does seem to be reflecied, at leasfto some extent, in 
the word perception skills of naive beginning readers. In the graphic domain, it 
was concluded that configuration as formed bythe ascending and descending 
letters of lower-case script is a cue for, the visual memory of words and that 
initial letters are more salient cues than final letters. In the phonological and 
orthoqraphic domains, the authors conjectured that even though information 
was not extracted by the subjects; the skills to extract phonological and 
orthographic information might develop with an increase in age and reading 
ability In the semantic domain, it was concluded that neither concreteness 
' nor abstractness seems to affect woTS learning. In the syntactic domain it 
. was concluded that nouns are more easily learned than verbs. Also, .the 
addition of picture cues to a word learning task does not appear to influence 
learning efficiency. * 
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Children's explanations of word similarities 
in relation to word knownness \ 



Researchers have been gathering evidence over the past few years that 
young children beginning to learn to read do not have a very firm grasp on the. 
meanings of linguistic terms which are used in school instruction; they domot 
use mefa/language, of language referring to linguistic units and junctions .very 
accurately (Downing & Oliver, 1974; Ehri, 1975. 1976. 1977; Holden & 
MacGinitie, 1971; Mickish, 1974; Karpova, 1955; Papandrqpoulou & Sinclair, 
1974; Tovey, 1976). <. . u . ... A « 

One important metalinguistic concept for beginning reader's is that of word . 
Is a word to a beginning reader primarily a structural unit of letters a nd/or 
sounds? Or is if a unit which can enter Into syntactic arranyeme..U>? O. * il 
Drimarily a unit to which. people assign meanings, a semantic unit? Do children 
respond differently to written words than to oral words? Do they respond 
differently depending upon whether or not the word is known to them in written 
form? Is it true that, once words are learned in written language, it is possible 
to direct children to attend to different aspects of words, such as the 
phonology or the orthography of words, the syntactic ar ;a n 9 ement Q \ w0 S^ c n a n n 
enter into* or the semantic functions words can play (Gibson 1971 Mason, 
1976a 1976b)? A number of researchers (Golinkoff & Rosmski, 1976; Rosmski, 
Golink'off & Kukish, 1976; Wickens, 1970, 1972) have contended that, while 
children can be directed to attend to phonological and syntactic aspects o 
words, semantic aspects take precedence in the child's developing concept of 

The present study investigated the child's development of the concept that 
a word is a unit -used to express meanings (it also investigated the 
metalanguage used by the children during the experimental task). ™ot studies 
had indicated that younger children tend to think of words in writ ten language 
primarily as structural units of letters and/or sounds and that older children 
think of words in written language primarily as semantic units . 

In the pilot studies, however, and in previous studies into the development 
of semantic notions (Anglin, 1970; Naron, in press), the same words were used 
for all subjects. The pilot studies, design and measurement theory ' (Clark, 
1973). developmental theory (Brown, 1975). and developmental research 
(Richman Nida, & Pittman, 1976) indicated that the sample of linguistic un. s 
should be' as carefully selected as the sample of human subjects; the sample 
of linguistic units (here, of words) should take into account the subject s prior 
knowledge of the units. For these reasons, it was considered necessary to 
identify and use words which the children in the study rated as known and 
unknown in written language. 

• Tho a ,,th^r ,q mfiphtpd lo the following people for their help in making this study possible: 
Thomat Estes° \*ESS££. Edmu^ndTrson. & Herbert Richards all of the University of 
Virginia and the administration, faculty, and children of Sussex County. Virginia. 



51 Gi 
1 



Method 



V 



Subiscts 

Thirty children each from grades one, two, four, and six were chosen at 
random from an elementary school in a rural Virginia county. In this school 
system no one method of teaching reading was used to the exclusion of other 
methods and children were placed in classrooms using chronological age as 
the primary criterion under a continuous progress model. Achievement and 
ab.lity scores in the county averaged below national norms; however, the 
ranqe of scores was equivalent to the national range. The racial breakdown of 
students (75% black and 25% white) was within 5% of the population totals 
for the county. 



Procedure and Design . 

Each child took part in two tasks: A Knownness Rating in which his or her 
' words were selected ana a Word Choice task in which the child explained how 
chosen words went together for him or her. The words for the Knownness 
Rating came from word lists of nouns chosen at random from four sources: 
Harris & Jacobson core words for first grade (Harris & Jacobson 1973); 
Sheldon basic rea ding senes,( 1968); Clarence R. Stone's revis ion i o/ the Dale 
nTsJoTmeasywords (Spache, 1 9Wand~T77e teachers wortttttrotWrOOQ- 
words (Thorndike & Lorge, 1944). trie American heritage dictionary of the 
Enqhsh language (1975) was used to determine if the most common usage of , 
the randomly selected word was indeed noun. Words were.arranged in lists 
according to frequency of occurrence to aid the children in selection. From 
the lists each child rated 24 words as known and 24 words as unknown to 
him/her in written form. The Knownness Ratings were conducted a day prior to 
the Word Choice task. " „.',,',. 

In the Knownness Rating examiners scored a word Known (K) if the child 
pronounced the word cor.rectly (within the child's speech patterns) and if the 
child gave a gist of meaning for it appropriate to the community The 48 words 
from the Knownness Rating were cast into four knownness configurations for 
the Word Choice task. The four knownness configurations were used to study 
the effects of the child's knowing all of the words (K-KK); knowing none o the 
words (U-UU); knowing either his/her word, the ques lion word (K-UU> or 
lowing only the examiner's words the choice words U-KK) , in an X.XA*B 
paradigm. The four knownness configurations (K-KK, K-UU, U-KK U-UU) and 
the two presentation modes (written and oral) were counterbalanced and 

• assigned to subjects at random in the Word Choice task. 

The dependent measure was the children's explanations of choices in he 
Word Choice task. Children were asked to explain: "How does your word I he 
question word) go with my word [the choice word] for you? Prior to the 
experimental task, the examiner used a sample triad, eliciting or model ng 
answers that were both semantic and structural in nature, high eve as .we I as 
trivial Responses and probes were recorded on individual protocols Content 
analysis indicated that the intention of the response^ could be reliably scored 
on a 3-point scale (1 = structural responses; 2 = unclassifiable response?, 3 
= semantic responses). 

' Independent variables were grade level, knownness of the question word, 
knownness of the choice word, and mode of presentation Protends were 
used for descriptive analysis of metalanguage, as well as for the quantitative 

• analyses. " • 

a *) > 



Results 



In a four-way repeated measures analysis of variance children We . found 
to & sScantly more semantic responses in thre e cond itK>rk 1) rf the 
question w'Jrd was known as opposedto unknojjn, f 0 . 119) z 3648 p < .0 , 
means 9 0 and 8.4, 2) if the choice word was known F (1,1 19) - 5b.y- P V^ 1 
™!~c o i anri fi 3 and if the word was oral rather than written, F (1.119) - 
?1 ?? o < 01 means 9 and a3 Contrary to the results of the pilot studies ,n 
^hich' knownnSs was not controlled, main effect for grade level was not 
ZS^lS^s were sig^cant for knownness *c ™d^ 

Tuf!^e7oTZ mod' oVesenSio^ j&'ie) = .3.67 p < .01, Two- 
waVSses 'of variance treating each knownness con ,gura ,on , separate y 
aaain revealed no main effects for grade. Significant mam effects were 
SSn.TSS." P-eserita, ion i-ac* o< the »». ""ft-™ 

- 2 68 T< " M Examination bt trie interactions indicated that tw younges 
"tnMs^o,^ 

SSKTci SaJoTSod woS kll in written language semantical!, as 
'TelZ^Z™°tl7X -nduced to ex** trie 

wmmmm 

gSseS responds than when giving structural responses. 
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Discussion 



In view of previous research it is interesting that the effect of grade level 
was non-significant. The absence of a main effect for grade when we know 
children learn more words and word meanings as they grow older underlines 
the issue of whether there is a single semantic concept corresponding to 
^word" that would be manifested as a straight developmental trend or 
whether the notion of word for children depends upon the mode of 
presentation or other features such as knownness, as indicated by these 
results In controlling for word knownness, this study fits with other current 
research and theory which throws doubt on learner characteristics previously 
"believed to increase (as a main effect) with age (Richman, Nida, & Pittman, 
1976 Brown, 1975)! The study needs to be replicated and also needs to 
include more sensitive controls over word knownness, extending to both the 

* oral and written presentation modes. 

The descriptive analysis indicated that sixth grade children (and, to some 
extent fourth grade children) were able to control strategies of projecting 
meaning onto unknown words and of creating hypothetical contexts in which 

* unknown words as well as known words would "make sense," using the 
child's intention as the measure. Examinations of the interactions for written 
and oral presentation modes with grade level indicate that the means come 
t ogether at sixth grade. Rep lica tion of the study should incl ude an extension 

"upward into adolescence to see if main effects for grade level should appear 
from sixth grade upward. Indeed it might be predicted that the results for 
written and oral modes would be the same for-all configurations after sixth 
grade as they are for K-KK in first throughsixth grades. It can be argued that 
witft the known words (K-KK). the child does not have to invoke any meaning 
creation strategies, whereas, in all three of the other configurations (K-UU, U- 
KK, U-UU), s/hemust. 

The descriptive data raise the question of whether children who are reading 
in school tasks would not benefit from greater understanding of and use of 
rrtetalanguage. Given the state of research in reading methodology, we have 
little evidence now. Lmnea Ehri's contention (1976) that it is unnecessary to 
"teach metalanguage as a prerequisite to reading seems reasonable; children 
in this study who had learned to read successfully were learning about the 
re i eren ts of metalanguage during and not just prior to their contacts with 
written language The only immediate implication that could be drawn from 
this study for teaching is that teachers should try to learn what their students, 
individually, understand atfout and mean by the metalinguistic terms used in 
school tasks 
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Word prediction of good and-poor readers 



The role of semantic and syntactic cues in fluent reading has been the basis 
for much recent controversy. While somahave suggested that effective use of 
the contextual information provided by these cues is the determining factor in 
differentiating good and poor readers (Goodman & Goodman, 1977; Smith, 
1976) others have argued that virtually all readers, regardiess of achievement, 
employ semantic and syntactic cues and other factors must account for 
achievement differences (Weber, 1971; Kolers, 1975; Allington & Strange, 
1977; Allington, 1978). In each of these latter studies the use of visual 
information, or an interaction in the use of visual and contextual information ■ 
seemed to differentiate good and poor readers. Weber (1971), for instance, 
y noted that the majority of errbrs by all readers conformed to preceding 
contextual constraints but that good readers seemed to produce errors which 
more closely approximated visual characteristics of the target word. Similarly, 
Kolers (1975) found good readers' recognition memory for visual features of 
sentences exceeded that of poor readers. Allington and Strange (1977) found 
poor .readers ignored visual anomalies in text and gave instead a response,^ 
which fit syntactic and semantic constraints mbre often than did good readers 
who seemed more constrained by the visual information. Finally, Allington 
(1978) demonstrated that poor readers' recognition accuracy suffered more 
than good readers' wfiln syntactic information was eliminated. 

However, use of context does not seem to be an either/or situation. Mason 
(1977) has demonstrated the interdependence of various types of processing 
while reading, a resbtt which supports recent interactive processing models of 
reading (Rumelhart, 1975; Mosenthal, Walmsley, Allington, 1978). A simplified 1 
instructional strategy for inducing such interactive processing has been 
recently proposed by Dahl and Samuels (1977) and is called 'hypothesis-test' 
training. Here readers are taught to utilize both semantic-syntactic and * 
gra'pho-phonic information integratively. Instruction of this type produced a 
higher level of reading achievement than other more traditional methods 
(Samuels, Archwamety & Dahl, 1*974). 

However, we still know littlQ about how readers come to develop a sensitive 
strategy for employing these information sources. Goodman (1965V and 
Biemiller "(1970) have demonstrated that 1 use qf contextual information 
develops with reading achievement and Pearson and Studt (1975) have 
demonstrated the positive effects of contextuaUichness and word frequency . s 
-upon the word prediction behaviors of readers. There is a particular- need to 
clarify the utilization of semantic-syntactic cues by poor readers.- The 
question, then, for the present study is whether'good and poor readers of the 
same age level differ in performance on a task which requires the integration > 
of semantic-syntactic and k grapho-phonic information. Additionally,' the 
performance of older poo; readers Will be compared to that o{#ounger good 
readers. These comparisons should also provide an opportunity to test a 
recent proposal that poor readers have no specific skills deficit but rather 
perform very much like younger good readers (Guthrie, 1973). 



Method 



^students were drawn „om three '"V^^jfc^'TC 

fete? were considSed good readers while suBTects scorfe one year otW 
below I ad C e Teve, were oonside.ed poor reader Frem/he pools o. s uden 

program effects (Barr, 1977). / j 

M The a expenmental materials were those employed by Pearson an« I Start 

PaQ e , — , Sentences 



Word Pairs 

HF * 
Stop ' 



LF 
Halt 



Context Level 
. Poor 

Moderate 
Rich 



We decided to 



torawhile 



The men were ordered to 

You had better decide io * 
for that red light 



Tor one sentence and a low-frequency target word for the other. ^ 

Results and Discussion • • 

deviations are reported m Table I. 
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* Tabled 
Cell Means and Standard Deviations: Percentage of Words Weeded for Identification 



High Frequency Low Frequency 



Rich Moderate Poor Rich Moderate *■ Poor 

Students M {SD) M (SD) M ($D) M (SD) & (SD) M (SD) 

4G 7.2 (1.7) 39.8 (3.2)' 49.4" : (3.3) 59.9 (2.9) 65.8 (2.8) 8t).4 (2.4) 

4P 31.2 (3.4) 55.3 (3.9) 69.4 (2.8) 74.9 (2.6) 85.8 (1.7) 95.2 (1.2) 

2G 31.7 (3.8) 49.6 (3.5) 69.3 (2.8) 71.0 (3.3) 69.2 (3.3) 88.9 (2.3) 



There were significant main effects for, each of the factors under 
consideration. The 4th grade good readers needed 50.45% of the word to 
achieve recognition, 4th grade poor readers needed 68.66% and 2nd grade- 
subjects needed 63.31 %. These differences were significant F(2,57) - 1 1 .70, 
p < OOTThe mean forhigh frequency Words was 44 : 79% ; for low frequency 1 
wqrds 76 82% This diftererice was significant F(31, 57} = 105.18, p < .001. 
Unlike the Pearson and Studt study, there was no interaction between 
frequency and group. This would indicate that the effect for frequency was 
equally distributee! over all groups. ^ nni . T , . 

There was a significant effect for context, F (2,1 14) = 24.53, p < 001 . The 
mean proportion of word necessary to achieve indentification for ricr> context, 
sentences'was 46.0%; for moderate context, 60.9%, for low context, 75.5%. 
The effect of context differs across the word frequency levels as indicated by 
the significant interaction between these variables, F(2, 114) = 4.84;p<.01. 

These results with the exception of the -lack of a group x frequency , 
interaction ate similar to the Pearson and Studt (1975) results and. their 
discussion' is equally relevant'for this study. The fact that 4th grade good 
readers were able to identify the target words with less graphic information 
lends credence to their conclusion that the ability to use context re a -function 
of reading proficiency. The context x frequency interaction supports their 
conclusion concerning response availability. A richer context was more 
helpful when the target word was a high frequency word. When the target was 
a low frequency word much more graphic information was needed to achieve 
recognition. We also found many instances of students supplying the high 
frequency synonym for low frequency target words. 
' The second purpose of this study was to determine if good and poor readers 
employed different strategies to identify unknown words. In order to answer 
this question the incorrect responses were analyzed to infer which cue 
• systems-were being used to arrive at a response. In the absence of graphic 
cues the most frequent choice of all groups was to give a response that was 
semantical^ and syntactically appropriate (M = 81 .0),Second grade students 
qave such a response 75.7% of the time, 4th grade poor readers gave, such 
responses 75.9% of the time and 4th grade good readers gave such 
responses 915% of the time. , _ • 

Once graphic cues became available the response pattern changed. The 
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most popular strategy was to give no response [M = 51 .0) with this occurring 
in second grade students 49.8% of the time, in 4th grade podr readers 51.7 k 
of the time, and 4th grade good readers, 51.6%. The second most popular 
strateqv was to give a response that was graphically, semantically and 
syntactically appropriate [M = 27.3%). Second grade students gave such 
responses 26.9% of the time, 4th grade poor readers 23.7% and 4th grade 
qood readers 31 .2%. In each of these analyses the older poor readers and the 
younger readers performed similarly, responding less frequently to semantic 
and syntactic constraints than the older good readers. 

Conclusions 

The response analysis indicates that all students were able to utilize 
qraphic syntactic and semantic cues. Good and poor readers do, however, 
seem to differ' on the integration of these cue systems. This would support a 
notion of the reading process that includes an increase in the ability to 
inteqrate the cue system as a function of an increase in reading fluency 
(Pearson & Studt 1975). In this respect this study also supports Guthrie s 
(1973) proposal that older poor readers perform like younger good readers, n 
the absence of visual information (Trial 1) students in all groups were likely to 
supply meaningful responses, that is responses that were syntactically and 
semantically appropriate. Once graphic information was introduced (Trials 
2-4) each group seemed to be atfected in somewhat the same fashion. Close 
to 50% of the students in each group chose a no response strategy This 
would seem to further support Pearson and Studt's (1975) conclusion 
concerning response availability. It seems that the students were able to 
achieve an acceptable meaning for ,ttie target word but the fact that they 
lacked a word that fit all the requirements (graphic, syntactic and semantic) 
inhibited their ability to respond. This conclusion is confounded since he 
students knew their first response was in some way inappropriate. If a - 
response was given, most incorporated the graphic information. In fact, it 
would seem that graphic acceptability became the most -salient cue in 
selecting a response with the better readers who were more likely to Produce 
a response that agreed not only with this information, but also with each of. the 
other available cues. However, as noted earlier, the introduction of graphic 
constraints inhibited responses for the less skilled readers 

In summary, then, the rflder good and poor readers did differ on their ability 
with the experimental task but there seem to be no differences in the 
performances of the older poor readers and younger good readers. All groups 
were affected by contextual richness and word frequen' y but in some cases 
to different degrees. Finally, more skilled readers seemed to be able to use 
graphic information in conjunction with contextual constraints more efficiently 
and effectively than the less skilled readers. 
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The acquisition of knowledge from text 

A orogress report on prose research at the Cornell University 
reading research group ( ') 



Research on the acquisition of information from texts through the process of 
readmq is of obvious theoretical and practical importance. If we could specify 
hoi he s.mc°ure of the information in a text passage affects the .structure of 
the" knowledge acquired from that text then we would not only be able to say a 
grlatTai about cbgnitive functioning but we would be able to make a number 
o statements of great practical value as well. Among other practical statements 
we might be able to specify how a text should be organized to enhance 
The information acquisition process or to characterize questions to appropriately 
probe the knowledge acquired by the reader. 

Clearly the motivation for prose research of this type is very high. This 
is fortunate for research in this area is frustrated by the sheer enormity of 
he research domain, not to mention the complexity of the issues _ ^ved 
Linguists for example, have taken a long time to get around to solv ng 
problems of semantics, and for good reason. The study of mean.ng, and the 
acquSn of meaning, is a very difficult task. To make matters worse, there 
is even a lack of good tools for use in this kind of prose-related research. 

?n this paper I would like to be able to report that we have solved a lot 
of these problems in a series of brilliant studies. Unfortunately this is not the 
case Durmg , tto past two years, however, we have made considerable pro- 
gress in the development of research tools and we have used these in 
several studies or which I will briefly reportjater. 

An Information Processing Model 

To begin, let's conceptualize the knowledge acquisition process as a Problem 
,n data communications, as shown in Figure \ So m e '"^,^31. an author 
searches his base of data about the world and selects from it a set ot aata • 
ftems whiSh ,s to comprise the content of his written message At thi rst 
sfaae the author must make various pragmatic decisions about what informal on 
Slude i the Content structure based on considerations such the expec ted 
Sate of knowledge of a potential reader and about the inferential capabilities 
o f that reade The second state in this process model is the orgamzat.on 
o he Sed data or semantic content into a coherent tax base . from 
which a natural language representation may be generated At th. stage the 
information must be ordered into a reasonable s ?q ue a n n c ri e f ^ 
. marked for discourse features such as totalization and focus. FinaHy Imguistoc 
processes must operate on this text base to produce the actual natural language t 
strings which constitute the written text. „ Qnorat inn nrnress 

The reader presented with a text, must reverse this generation process. 
He must decode or parse and semantical interpret the surface form of the. 

McConkie 
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Figure 1 An information processing model of text generation and text 
decoding. * <jf K - 

text's natural language representation and generate a text base of his own. 
From this he then must apply any relevant inferential processes to arrive 
at the content structure which he concludes must have been the source of 
the author's written text. . 

Note that the model here is not necessarily a psychologically real one but is 
rather an attempt to identify the kind of processing which semantic infor- 
mation must undergo to conform to something that we readily know. We can 
only directly observe one of these stages, the text itself: The rest we must 
infer But since' peopfe can freely' generate texts on all kinds of topics in 
infinite variety we must postulate a process of selecting frdm the writers 
knowledge what he wants to say, or the message baqe. And since the same 
set of information "can be expressed in many different ways, different styles 
or with different focus, we must hypothesize a text ba$e, containing such detail t 
about how the text will be constructed. Below I will mention a study done in our 
» group which gives' evidence for the effect of staging operations and thus for 
existence of a text base stage. * , 

The Representation of the Content Structure Level 

Thfere is an obvious need in prose-related expeamental research to describe 
and characterize the stimulus, the text, passage tcfbe read. Also, since in our 
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studies we have used mostly free recall protocols, that is having the subject n 
write down everything he can remember from a passage, we also need ways of 
characterizing those free recalls, so we can identify just which aspects of this 
o g na passage have been retained. Ciearly these descriptions must be done 
in a non-arbitrary manner and in such a way so as to avoid he extreme 
var.ab.lity which is evident in the surface structure language of any Q (these texts_ 
This method of description is one of the research tools which has been 
mostly missing but sorely needed for research of this type. Consequently we 
have expended and are continuing to expend a considerable amount of energy 
on the development of a cons.stent and reliable meth od of reprwe nbng the 
semantic fnformation of a text at the content structure level, the first level 
seeminqly independent of linguistic processing. 

Sometime ago we began using a prepositional network or represent nj 
information at this level. In a network notational system of this type, the 
content or meaning of a text is divided up into underlying semantic units 
pnmar y states and events, and each unit is represented by a set of nodes 
and arcs connecting the nodes. Each node contains a semantic concept while 
me arcs are labelled to represent the pair-wise relationships which exist between 
the various concepts contained in the nodes/ In each case, be it an event or a 
slate the resultant set of nodes and arcs is called a proposition. 
" As a very simple example of this kind of representation consider the 
following two propositions given as an example in McConkie (1 977). 
Proposition i (:BALL)-ATT-> ('RED) ATT = attribute 

(a state) ^ .,«».. ' 

Proposition 2 ( JOHN)-AQT->('HIT)-OBJ->(:BALL) 
(an event)' AGT = agent 

OBJ = object 

As McConkie points out, these two propositions constitute a content *wcture 
which .could be said to represent the meaning of a number of different test 
strings such as 

John hit the red ball 

The red ball was hit by John. 

The ball hit by John was red 

The ball which John hit was red. 

It was a red ball which John hit. • , 

' The red ball which John hit... -\ , 



Thus 'the network represents certain semantic relations which can be expressed 
,n many dWfeJSS : ways, and ignores many aspects of varrability in the form in 
which that information is expressed. , Q , c^mLhat 

The nropositional notation system which we currently use is a somewhat 
simSnfie P d Son of *he one described in Frederiksen (1975V ^fenksen s 
system was chosen over other representational systems for several reasons. 
Firs his System is highly detailed By incorporating a large number of fine 
d sanctions in S taxonomy of concepts and in the classification of the relations 
wh c can co-occur withlheae various concept classes, his system is potentially 
enable of representing a broad range of semantic information. Secondly since 
a network representation has a natural implementation in a computer as a list 
structure Frederiksen's notatioo is 'suitable for use as the data structure in a 
eompu er-based data handling system. This is an important consideration since 
one of the thinqs that makes prose research difficult is the great ease wrth 
Zc on ^ can become innundated with data. Some kind of automation of the 
data handling requirements is really mandatory. In fact, we have developed an 



interactive lab computer software system, called PROSYS, for facilitating the 
input of such networks and for operating on -them for data handling and 

analysis functions ,. , 

Finally Fredenksen's system is suitable, for hand-coding. I use the term 
■hand-coding' here to point out the difference between the way we arrive at the 
content structure of a text and the approach taken by a number of workers in 
the field of artificial intelligence. In our case, human scorers accomplish 
the analysis of a text into a proposition! network which represents the content 
structure of that text. In the artificial intelligence case a computer program 
parses an input text and renders it into a semantic representation. In a real 
sense the output of the two different approaches is the same - a semantic 
representation at something like what I have been calling the content structure 

!♦ the two approaches output essentially the same thing, then why not 
employ an automatic parsing approach? The reason is quite simple - as yet such 
computer text understanding systems are not as good as human scorers at the 
kind of semantic analysis that is required for psychological research. The problem 
with automatic parsing is that many natural language text strings present 
enormous difficulties for machine parsers while human scorers can easily 
understand and represent their meanings. Copsider the following example from 
Charmak and Wilks (1976). Suppose a restaurant patron makes the following 
statement "Waiter, I would like spaghetti with meat sauce and wine 
The patron would be surprised to get spaghetti covered with a mixture of 
meat sauce and wine. If. the order had been 'with garlic and butter, however 
a mixture of the two is exactly what would have been expected. Charniak and 
Wilks point out that this has been considered to be a linguistic problem, one of 
structural ambiguity of the underlying syntax of the 'with' phrase. The meaning 
of' this phrase however, is disambiguated on other than syntactic grounds^ 
An inference must be made based on one's knowledge of food. This kind of 
disambiguation process is simple for people "and a scorer would have no 
difficulty in representing the meaning of this sample sentence. A machine 
parser on the other hand, can have problems with ambiguities of this type. 

Of course there are problems with hand-coding as well. One of the most 
significant is the problem of achieving good inter-scorer reliability. Unreliability 
comes from two sources: lack of agreement on the meaning to be ascribed to 
a specific piece of text and the use of different representations for the same 
meaning. We have been working to reduce this latter, source of unreliability 
bv developing explicit and detailed descriptions of the elements and relations 
available in our modified prepositional notation system and by producing 
a set of procedures for using these. Pearson (1977) reports on the reliability oL 
usinq a propositional network representation jn a slightly different case', 
namely in a situation where the network representation of a stimulus text has 
been determined and is to be used for the scoring of free recall protocols. The 
reported results indicate that the degree of reliability .vaires with respec to 
the level of analysis When considering the amount of information .reca led 
in the protocol that is, the number of elements in the stimulus text representation 
found to be present in the recall, the reliability was extremely high (T=-99). 
When considering the type of information recalled, the reliability varies with 
the level of representation, that is, with the number of categories (for proposi- 
' tions r= 84 for elements r=.78). It is interesting to n«te that in this 
reliability study the largest amount of variability, some 17% was due \ol 
scorer errors or omissions and that only 7% was attributable to notation^ 
related difficulties. 
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V 

Some Examples of Research 

in 'the information processing model (Fig. 1) discussed above I have 
depicted staging as an operation the message base which .denies toe 
relat.ve prominence given to various segments of the semantic content, Jhere 
are a number of devices writers use fo vary the staging level or degree of 
prommence given to different information in the text. In one study from our 
Katory Paul Clements (1 976) investigated what effect some of these have on 
he reader, that is, what is the role of staging in the.reception of information^ 
n these experiments two groups of students .were given pairs of gssages wh.ch . 
differed with respect to the level (high prominence vs. low prominence a . 
wn ch specific chunks of information were staged The hypo thesis > was that 
sr-cific Dieces of info/mation would be better recalled after reading a passage in 
w'hict .h P ann,orma..qn is staged high than after reading , a . passage mh which 
the same information was staged ' lower. The experimental results strongly 
supported this prediction and further suggest that staging primarily effects the 
ac^is S of information during reading itself rather than having its | effects only 
later during. retrieval. As part of the experimental, procedure, Clements had 
Ss subiects rate pieees of information in the passage for their relative impor- 
tance The results showed that subjects ratings did not correlate with staging 
deferences Thus even though information staged higher was not recognized as 
being more important to the passage, still the readers were more likely to 

TnTnother study, Lucas (1977) investigated whether the Wpe of test format 
that readers anticipate influences what they retain from reading a passage. 
Subiects wer<7asked to read a series of two passages. After the first passage 
one group was given, a number of test questions and another group was 
asked to write down everything they could remember - a free recall test 
S?Srts exiected the same type of test on the second passage, but instead 
bot ^g^cu P s P we eg v ^ e n'a m ee Recall test. The experimental results showed that 
Se tSe of test a g,oup expected did have an effect on what they retained 
fom tht passage The most obvious effect was that subjects who were given 
aTee recaT teft af er t h e first passage read the second passage an average 
01 28% Sower than those who had been led to anticipate the question test, 
although they did not recall significantly more propositions. Lucas notes that 

A difference in readingjime would seem to indicate a difference 
,n the processing activities of the readers, which should resuIMn • 
differences in what is, retained from the passage. While t is clear 
fharcharactenstics of the passage itself had I the greatesnnfluence on 
what was recalled, the reading strategies did have some effect, as 
revealed by the results of the analyses on patterns of recall 
Information The fact that the total number of propositions recalled 
did not differ significantly between the experimental groups suggest 
Sat had the recalls been analyzed using "idea umts .or other 
less fine-grained techniques, the conclusions drawn might well 
have been "no difference." 

Data from this further analysis led Lucas to conclude that anticipation of a 
S rS test may influence ^ reader to acquire more centra, aspects oi a 
text and to be more aware of the manner in which the information was 



presented in the text 
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The final study which I will briefly mention is one by Scott Smith (1977) 
'which investigated memory over varying intervals of time. Subjects were given 
a passage to read and immediately produce tree recalls. After three weeks 
some of the sublets were unexpectedly tested again and another group was 
tested after three months. Smith's most significant finding was that 

First, the recalled information most similar to the passage is much 
more stable over time than recalled information modified from the 
passage. Second, there is a substantial amount of information present 
in delayed recalls that was not present in an immediate recall. 
Thus, the immediate recall must not provide a complete indication of 
the information which the subject has retained from the passage. 

In his analysis, Smith was able to see which concepts and propositions, at 
the lowest level of analysis, were retained over different retention Intervals. 
' However none of the studies mentioned have made attempts at comparing the 
structure of the content of the passages read with that of the recall protocol. 

Summary and Promise for the Future 

What then do 0 we know from all of this? First we conclude that in order to 
answer many of the detailed prose-related questions, we need to have the 
kind of rich experimental data which we get from using a detailed propositional 
network representation of semantic content structure. These are questions like 

-what " types of influence does the passage structure have on what 

readers tend to remember from it? 

-what are the structural characteristics of subjects' cognitive representations 
of information from the passage, and how do text and task variables 
influence this? . . , 

-in what ways do passage and cognitive structure influence the retrieval 
and use of'information obtained from passages? > 
Second we know we need to further specify procedures for analyzing 
semantic content structures and third, we need some computer tools to help us 
pull structural information from the huge data matrices which result from the 
translation of semantic networks into their equivalent numerical form. These, 
of course are primarily methodological 'results'. We believe thai the analysis 
techniques and computer programs which we are developing will be irnportant 
contributions- to this type of research. In addition, I have mentioned^ some 
substantive things which have been learned, the most important of which is the 

work by Paul Clements on staging. * 

The promise for the future is for more small steps which hopefully will 
chip away at the big problem. Recently we have been making good progress 
in the third methodological area I just mentioned - the task of pulling structure 
out of recall^semantic content structures. With good computer tools for this, 
ther/6 is promise of additional experimental results which will hopefully move us 
closer to answers to the kinds cf theoretical and practical questions with which 
I started. 

References 

. Charniak E &Wilks. Y Computational semantics New York; North-Holland, 1976. rntnatl 
cJJttts P The ettec,s 01 S ' aginQ ° n r6Ca " U ° m PmSe dissertation, Cornell 

. University, 1 975 



ERIC 



66 



F.eoookaar, C H Baw.aenong logical ano aemaotlo alroclo.e ol knowledge aooui'M WOT 

Ampnran Educational Research Association, New York, April 1 977. 
SnTnlSSS^ vary/ng «nl»v* o/ f/me. Paper Presented at the annual meeting of the 
American Educational Research Association, New York, April 1 977. 




67 • ' 



, ■ , , CONNIE A. BRIDGE 

, University of Kentucky 

ROBERT J. TIERNEY 
University of Arizona 

MARY JANE CERA f 
Kino Learning Center 

Inferential operations of children • ;^ 

involved in discourse processing f 

• The factors involved in the comprehension and memory of text have 
recently become the focus of studies in verbal learning Basic to the study of 
text comprehension has been the development of\detailed models of 
discourse structure, which are seen as a representation of the knowledge 
structure of the writer or speaker producing the text. The ability to analyze a ' 
text into specifically defined and ordered units has provided networks against 
which the subject's memory for text may be compared. Thjs comparison of the 
subject's knowledge structure acquired by reading the text to the knowledge 
structure underlying the generation of the text itself has provided insight into 
the characteristics of discourse a<3 well as the interaction between textual 
characteristics and discourse processing operations. \ 

One aspect of discourse processing that becomes obvious when the 
network of the subject's recall is compared to the network of the original text 
is that when subjects read and recall text, they acquire some information 
whiph was not explicitly stated in the text itself. This generated or interred 
information can be investigated by comparing the propositions in the subject s 
recall which were not explicitly stated in the text to those propositions^ 
actually stated in the text. \ 

An examination of the inferences and inferential processes involved m 
generating inferred propositions from stated ones provides supportive 
evidence for /^onstructivist theories of language processing. The 
constructivism contend that the comprehension-memory system does not 
process earfhi input sentence in its entirety; but, rather, selectively processes 
the input /using information selected from the input sentence and stored 
knowledge about the world to generate a semantic interpretation based upon 
the input data in the text (Frederiksen, 1 975b). 

The definition of inference used in this study was relatively unrestricted in 
Order that it might reflect the pervasive role of inference in language 
processing. As stated by Frederiksen (1977b), "Inference occurs whenever a 
person operates on semantic information, i.e., on concepts, propositional 
structures or . components of propositions, to generate new semantic 
information i e new concepts or propositional structures. Any semantic 
knowledge which is so generated is inferred" (p. 7). Thus, any information 
generated by the reader that was not explicitly stated in the text was classified 
as an inference. .. . . . 

The purposes of this study were to find out how much explicitly stated 
information subjects remembered from reading a passage, how much inferred 
information they generated during that process, how many total units , of 
information were recalled, and the relative proportion of inferred information 
to total information in their recalls. Furthermore, the study was designed to 
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• 1 rood readers* recalled significantly more units of explicit information, 
2 87.p< 01. and had longer total recalls, f(J4) i.ri^v 
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Table 1 



Mean number of explicit, inferred, and total 
items of information in the recalls of good and poor readers 



Types of Information 


M 


cn 
oU 


Explicit Items 






Good 


91.89 


25.46 


Poor 


AfK 17 * 


0 


Inferred Items 






Good 


85.61 


51.17 


Poor 


44.78. 


31.87 


Total Items 






Good 


177.50 


60.36 


Poor 


92.94 


45.61 ' 



uc 



2. Within each group and for each type of info r maticn'(explicit, inferred, 
and total), there was a wide range between the subject with the highest 
number ofunits recalled and the one with the lowest. 

3 There was considerable overlap in the scores, with some of the poor 
readers recalling more units of information than some of the good readers. 

4. In both groups there were individual subjects who had high explicit 
scores but low inferred scores. The reverse situation alSo occurred, but less 
frequently. 

5. The individual who had the highest explicit recall score remembered 
only half of the totafpossible concepts and relations in the passage. 

6. There was no significant difference in the proportion of inferred 
information to total information in the recalls of good readers compared to 
poor readers. 

7. Good readers made significantly more causal and conditional 
inferences, both of which involved dependency operations, t (34) = 3.66, p < 
.01, (see Table 2). 

«. Poor readers made significantly more superordinate inferences in 
which a more general concept was substituted for a specific one. These 
overgeneralizations im/blved jnacrostructure operations,./ (34) = -3.45, p < 

°^ ' g.^there was no significant difference in the proportion of inferences in 
the other six classes of operations included in the Taxonomy of Text-Based 
Inferences. 

^Conclusions and Discussion 

Based upon the findings regarding amount and type of explicit, inferred, and 
total information in the recalls, it was possible to conclude that: 

1 Both good and poor readers engage in some constructive or 
inferential processing during reading and recall, with nearly, half of the 
information in the retelling being inferred. ' , 

These data support the constructivist view of comprehension which holds 

So 
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that the reader selectively processes the input data, using information from 
hi text in Soc^n^ his/her own background of knowledges construe^ 
a mea^ of the material. Because of this interaction d 

nr^ouTknowledge^ the new information, it was not surprising that aathe 
%Va™S$w\s increased, so did the generation of inferred uorts ■ ■ 
As Andefson Reynolds, Schallert, & Goetz (1976) pointed out, in terms of 
schema i theSy the mam determinant of the knowledge tha* an individual s 
b eTo a qS^l om reading is the knowledge that he/she brings to the task* If 
fpoor or inexper,enced reader does not possess the 

not know how to bring this schema to bear upon the task, he/she will be unable 
to comprehend the material. . . 

Table 2 

Mean proportion of inferences in each class 
of inferential operations in the recalls of good and poor readers 



Class of Inferential Operation 

i 



Mean Proportion 



Class I Lexical 
Good 

.Poor r 



.0200 
.0194 



-.Class II Identifying 
Good 
Poor 



.2161 
.2088 



Class III Frame 
Good 
Poor 



.1068 
.1080 



Class IV Event 
Good 
Poor 



.2011 
.1961 



Class V Macrostructure 
Good 
Poor 



.0436 
.1871 



Class VI Algebraic 
Good 
Poor 



.1918 
.1732 



Class VII Dependency 
Good 
Poor 



.1815 
.0746 



Cl&ss VIII Truth-Value 
Good 
Poor 



.0391 
.0329 
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2 Comprehension is abstractive as well as constructive, with the. 
readers choosing to retell what they consider to be thejjiost relevant units 

f rom the text . . . ',- . 

Even the subject who had the highest explicit recall score remembered only 
half of the total possible concepts and relations jn the passage. This provides 
further evidence for the selective nature of comprehension and recall. In 
terms of the constructivist theory of comprehension, it seems that the reader 
selects from the input data those propositions which he/she considers 
relevant in terms of his/her own schematic expectations and background of 

knowledge ,. „ . 

3 The comprehension process appears to be essentially the same for 
both good and poor readers. Both good and poor readers engage in a 
substantial amount of inferential processing, differing primarily in terms of the 
good readers' superior ability to deal with interpropositional relationships. 

- The finding of this study that good<readers made significantly more causal 
and conditional inferences provides evidence of their superior ability to use 
schematic expectations of relationships to infer unspecified information. ^Jke 
the truly-fluent adult readers in Marshall's (1977) study, the good third grade 
readers were able to infer unspecified relational information. Marshal 
concluded that this awareness of the importance of interpropositional 
relationships and the use of these relationships to organize their recalls is 
indicative of the fluent readers' ability to'recreate the author's schema. 

In summary it is obvious that an extensive amount of inferential processing 
occurs during 'the processing of even "simple" stories, such as the one used 
in this study Furthermore, all readers can and do engage in inferentia 
processing during reading and recall, as inferencing is a requisite and integral 
part of all language comprehension. Good readers generate more inferred 
information but also recall more explicitly stated information; thus, the 
proportion of inferred information to total information is almost the same in the 
recalls of good and poor readers. Good readers generate more causal and 

' conditional inferences between propositions which adds to the coherency and 
cohesiveness of their recalls, while poor readers tend to substitute more 
qeneral concepts for the specific terms used in the passage, resulting in 
recalls which lack the accuracy and specificity of the recalls of good readers. 
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Comprehension profiles of good and poor readers 
across materials of increasing difficulty 

The abiluy of good and .poor readers to .adjust their strategies to 
icrommodate increasingly 'difficult material was investigated in this 
ISSorf studv A st?ategy was defined as a purposeful means of 
Cehendmg d a y n author's Message. Information processing , researc i by 
Goodman (1969) Smith (1973), and Newell and Simon (1972) lead to a view ot 
th° rVade as an active information processor whose interest in compre- 
hencing fhe author's message causes him to apply readme I strategies . The 
ease with which the reader comprehends may be understood in terms of the 
adeouacv of his strategy's and the problems set forth in the text, t was 

* ejected that good readers given material of varying difficulty, would use 

* ^ rate 1 more 9 frequently w,?h the most difficult material Research *r , the 

difforpnees between good and poor comprehended (Smith, 967 Golinkott. 

975: 976) indicates good readers- are more adaptive and flexible m their 
oatterns of readme than poor readers. Olshavsky (19^6-1977) found that good 
Sers app?y strategies more frequently than poor readers. There ore he 
hypotheses for the study predicted that only the good' readers would ad]ust 
their use of strategies according to the difficulty of the material. 

H, The frequency of strategy usage by good readers will increase as the 
material becomes more difficult. 

H ? There will be no difference in the frequency of strategy usage across 
the material by poor readers. 

Method 

De T S he n study utilized a 2x4 analysis of variance with repeated mesures The 
' f.rst variable was reading proficiency with two levels, good and poor, the 
second variable was material difficulty with four levels. 

^Twelve eleventh grade students, six good readers and six poor readers were 
random y selected 9 from two English classes a. a high schooMn a smalL 
m.dwestern city , Reading ability was determined by scores on the? 
comprehension subtests of the Stanford Diagnostic Reading Test, Level IH 
Good readers were defined as those who scored at stanines 8-9; poor readers 
scored at stanines 3-4. 

Ma E f ach's,uden, read the same material excerpts jfronr .four short stories. The 
stnnes were selected according to readability (levels 7-8, 9-10, 11-12, and 
IS 1 accor m 0 the Dale Chall 1948 readability formula), length 
(approx?mate!y 500 words), and writing style (abstract as rated by five out of. 
six raters). 
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Data Collection , , 

Each student was instructed to read silently and pause aUer each 
independent ctause (marked by a red dot) to talk about what happened in the 
story and about what s/he was doing and thinking as s/he read. Next s/he was given 
a practice Session to familiarize him/her with the procedure and tape, recording. 
The session continued until Jhe researcher decided the student was talking 
freely Finally, the student was given each story in rotated order and was told 
to read each and follow the reading-varbalizing procedure s/he had practiced. 
The student was not prompted by the researcher as s/he read. 

Data Analysis ? , ^ , 

Matching protocols and clauses. The students recorded verbalizations 
were transcribed, and 'the transcriptions were matched with the 
corresponding clause in the story. An example follows: 

Clause He makes no sound at all 1 
Protocol The boy's unconsoious 

Identifying and categorizing strategies. Each protocol, the transcribed 
record of a student's verbalizations about his/her ongoing behavior, was analyzed 
lo infer reading strategies. Newell and Simon state/ 4 . . . if the subject employs 
'definite processes. . . . there may occur enough repetitions of essentially the 
same situation to allow us to induce what the processes are and to have some 
faith m the reality" (1972, p. 191). The process of categorizing students' 
protocols involved three steps. First, the researcher compared each response 
with the text and recorded a description of the student's protocol. Second, the 
researcher decided whether the protocol. fit the definition of a strategy. Third, 
each strategy was given a descriptive name if it occurred at least four times. 
The protocol presented in the example above was categorized as a strategy 
because it represented a way. of dealing with the author's message. The 
strategy was termed inference because the student added an interpretation of 
the clause. A -second scorer classified the protocols of two randomly selected 
students with 95% agreement. 

Testing the hypotheses. Analysis of variance with repeated measures was 
used to test the frequency of strategy usage across the four stories. 

Determing accuracy of paraphrases. The protocols which were para- 
phrases of the clause were scored as accurate if they preserved the author s 
meaning or inaccurate if they distorted the author's message. 

Results 

Reading Strategies 

The eleven reading strategies which were identified are listed in order of fre- 
quency and defined: (1) inference' ^25 occurrences) is an addition of an 
interpretation or suggestion to the information in the story; (2) hypothesis 
(63 occurrences) is a prediction of the meaning; (3) stated failure to 
understand a word (54 occurrences) is an identification of the proolem of 
inability to understand a word; (4) synonym substitution (48 occurrences) is 
use of a synonym in place of a word in the text during the protocol; (5) personal 
identification (42 occurrences) is drawing personal associations in 
comparison to information in the text; (6) re-reading (22 occurrences) is a 
statement that the clause was read twice; (7) addition of information (22 
occurrences) is addition of factual information to the information contained in 



the clause (8) skip a word (14 occurrences) is a statement that a word was 
skipped. (9) srared failure to understand a clause (15 occurrences) is an - 
Kjemihcaiion of inability to comprehend the meaning of a clause; (10) picture 
story (b occurrences) is a statement of a visual image of the story; (11) 
conclusion (4 occurrences) is a statement which summarizes the story. • 

Tests at Hypotheses ' . ... . .... 

Anaivsis of variance based on all strategies found no significant difference- 
tor to? 'first variable, reading proficiency, F(1.10) = 2 05* >.0£ The second 
vanaSte difficulty of material was significant, f*(3.30) = 3.71 p< 05 The 
nTerwSon effect was no, significant. F(3,30) = *4 ,p results of 

this analysis fail to support the first hypothesis bu t ^* u PP°n the second 
hypothesis The total number of strategies used by both good and poor 
readers decreased (153. 147, 81. 82 respectively) as the stories increased 
in difficulty 

Comprehension ot the Stories , „„i.„.!„ 

As a chec* on the effect of the material difficulty variable, an analysis of 
variance was performed on the overall number of accurate and inaccurate 
paraphrases by good and poor readers for each story The test i revealed I that 
as the difficulty of the stones increased, the number (23, 1 8v 3b, bt) at 
inaccurate paraphrases increased, F(3,30) = 5.5.p<.00i and the number 
(325 185 1 33. 1 20) of accurate paraphrases decreased, F(3,30) = taps uui . 
In neither analysis was the interaction effect significant. _ 

Discussion 

The finding that both good and poor readers used the eleven strategies less 
' frequently Is the stones became more difficult may indicate they Judged their 
ma^enal ea% in their reading and 'gave up' trying to understand the more- 
difficult selections. In previous research, it was found that good readers used 
strategies more frequently than poor readers. For this reason, it is assumed 
that the difficulty of the material rather than the procedure caused these 
sub ects to have similar profiles of strategy usage. The protocol technique, 
thouah possibly causing some interference with the reading process, has 
been use^w ddy ln cognitive psychology research because it reveals process 
ahd has provided valuable information about reading. Further research should 
fnvesttgie whether the process of applying strategies is a necessary part of 
comprehension If strategies are essential, teaching readers to apply them 
would be one means of helping them cope with the difficult material they 
encounter in high school and college reading. 
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The relationship between reading ability 
and semantic verification tasks 



Recently Were in psycholinguists and computer simulation of human 
though" P ocelSave addressed themselves to the processes involved in., 
understanding wrrffen discourse. These efforts are particularly apparent in the 
a?ea o semantic information processing, one means by which ^readers gam 
information from written disc3urse. The present study was cjrwjmedwrth he 
factors outlined by semantic processing research which may affect the 
reading comprehension abilit.es of good and poor adult readers^ , 

The terminology for defining semantic processing in this study was derived 
fr^p ^osafs of linguists that the universal basis tor semantic memory is not 
™s£rf£e structure of a sentence, despite its effect on the ease or difficulty 
61 comprehending discourse, but rather the deep structure or conceptual 
o gaXion of the discourse. Fillmore (1968) proposed case grammar as a 
means o describing deep structure, I* this framework the verb serves as- he 
focus oi fa?hoSght unit and other concepts within written discourse are related 

t0 pa e ra5elinq the work of Fillmore, other researchers attempted to create 
compu er simulated programs that would meaningfully-process and respond 
to En discourse, Frijda (1972) and Minsky (1968) outlined ' 
of artificial intelligence programming, proposing tha schemata or .memory 
structure are similar to the processing strategies of the computer when .t 
responds "intelligently" to human discourse. _ 
The research of linguists and compute/ scientists converged m QuiHians 
(1968) D ro D osal foPa hierarchically structured semantic network. This network 
Sd b? Qu iSn served as the basis for describing semantic memory 
Processes for human and artificial intelligence. The hierarchica l nature i of 
?..?h a nSwork was experimentally verified by Collins and Quiihan (1969) when 
hey^\Tnst^ k aTe a d^ e h X a P t e the time required to recognize a — oU 
q.ven noun was a function of the hierarchical "dis ance betweer thest mu us 
word and the superordinate. Furthermore, they found that attributes 
specifically related to a target noun were more rapidly accessed than we e 
affiles related to nouns superordinate to the target noun in a semantic 
Crarchy Additional support for a schematic relationship waf . developed by: 
Rumelhart, Lindsay and Norman (1972), Meyer (1970), and Collins and Quillten 

(1 Within this same period, a considerable body of experimental evidence was 
■qamered in support of these hierarchical, constructive theories of human 
memory SeveS studies supported the notion }^ e ^\^ s f 0 l u °J 
organized relations (Batttg & Montague, 1969; Grober & Loftus, 1974 Loftus 
°973 Meyer, 1970; Rosch, 1973, 1975). Other studies demonstrated that 

•The research reported here is based on the first author's doctoral thes,s completed a. the 
University of Minnesota, Minneapolis, Minnesota, 1977. t 
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concepts are the basis of memory (Bransford, Barclay & Franks, 1972;^ 
Bransford & Franks, 1971): 

' Freetiman and Loftus (1971), Loftus and Grober (1973), and Schaeffer, 
Lewis and Van' de Car (1971) determined the developmental nature of 
semantic schemata This finding was important to the present study because it 
suggests the possibility that one of the differences between 'good aid- poor 
adult readers may result from developmental differences in their semantic 
encoding and processing strategies. 

The -goal of the present study was to integrate two paradigms in a single 
experiment to combine some traditional reading measures with some novel 
semantic processing measures in an effort to determine whetheror not either 
set of measures (or some unique combination of measures from either or both 
sets) would differentiate good from poor readers. * , " 

In order to determine the nature and degree of these relations groups of 
good and poor readers were established with tasks similar to those described 
in semantic processing research. Folk item types, presented as true-false 
questions, were posed following the reading of a brief paragraph. The item 
types included: integration of information from "sentences within the 
paragraph; recognition of the superordinate for a noun, neither of which was in 
the paragraph (unprimed relationship); recognition of, a superordinate for a 
noun included in the paragraph (primed relationship); and recognition of a 
sentence from the paragraph (verbatim recognition). 

Method 

Subjects ■ ..... .. 

Sixty community college students from a two-year suburban Minneapolis 
college served as the subject for the study. All students enrolled in reading 
and study skills classes were given the McGraw-Hill Basic Skills System 
Reading Test, Form B (Raygor, 1970). A random sample fij 30 students 
achieving a total percentile score of 73 or above (good readers) and 30 
students achieving a total percentile score of 27 or below (poor readers] on 
two-year college norms were selected for inclusion in the study. 1 

Materials 4 4 . . ... . 

A measure to determine Ifevels of information processing consisted ot thirty 
experimenter-constructed paragraphs. They were constructed from the^ 
category-prototype lists of Rosch (1975). j/^TT 
For each prototype a three-sentence paragraph was constructed. One 
• sentence presented the prototype plus a descriptor, e.g., "The suit was 
' denim " A second sentence-presented the prototype in relation Jo another 
noun e g "The model wore the suit." In half dt these sentences the prototype 
appeared as the subject; in half 'the prototype appeared in the predicate. A 
third sentence consisted of the related noun plus a descriptor foMhat noun, 
e.g.. /The model was skinny." All sentences were in the active voice. The 
order of the sentences within each paragraph was randomized. 

For each paragraph four test items were constructed. Each was a sentence 
presented following the paragraph and required a true or false response; The 
first sentence, based on the work of Bransford and Franks (1971), consisted of 
the subject and the related noun plus one oMhe.-two descriptors. The subject 
descriptor was used half the time, e.g., "The skinny model wore the suit;" the 
predicate descriptor the other half, e.g., "The model wore the denifn suit. In 
the case of false items (10 of 30) the descriptor preceded the incorrect noun, 
e.g., "The model wore the skinny suit. n 



The unpnmed relat.onship item. based on the research of Ob I ins -and 
Qu.ii.an (1969) and Conrad (1972) consisted of a categdfy-pro otype 
statement that .s -the category and prototype were not mentioned n the 
naraaraDh n the case of a true item the prototype and category matched, 
Tc 9 An app " a tut . ' in the case of a false item, prototype and category 
did not match, eg. "An apple is a vehicle " For both true and false i .terns he 
category and prototype were other than those serving as the basis for the 

° a A?hKd h test sentence expanded Rosch's (1975) use of the prime in that the 
category-prototype statement included the prototype discussed in the 
paragraph eg A suites clothing" Fo.r false items (10 of 30) the category 
was one ofthe other nine indicated by Rosch, e.g., "A suit is a vegetable 

The fourth test sentence was a verbatim recognition sentence. Ten ot each 
ot the three sentence types were .selected. For true items the sentence 
consisted of a verbatim statement from' the paragraph e.g., The suit was 
denim " For false items the descriptor was changed for a noun c esc iptor 
sentence, and for the noun-direct ob)ect sentences a descriptor was placed 
before the incorrect noun, e.g.. "The suit was skinny. 

Order of paragraph presentation ana item types within paragraphs was 
randomly presented to paired good and poor readers. 

Pf XffndTvtdual student, seated before a rear-screen viewer was. shown the 
short three sentence paragraph When the student felt he/she understood the 
fon ent of the paragraph, he/she pressed a single button ^S'jjg the scre^ o 
qo blank Four items, representing the four semantic verification tasks were 
fhen presented one at a nme. To each the student responded true or 
Ta" e" and simultaneously pressed the button. Students ;we re hen asked to 
indicate their confidence of response on a scale of 5 (high to 1 °«>- 
four item responses another paragraph was presented until thirty parag aphs 
and item sets had been presented. For both paragraphs and items 
Presentation of the slide activated a timer that was stopped by the student s 
£ ess o the button The examiner recorded the paragraph or item reading time 

• after each shde as well as "true" or "false': response and confidence rating 
for the individual items. 

De Fof purposes of analysis, the data were organized according to different 
layouts approp . ate for ihe several analytical procedures used. Mult var.atf 
and univariate analysis of covanance and analysis of variance for mam 
nteract'on and simple effects as well as f tests o group means were 
computed These analyses are reported in detail by Hanson (1977). hor this 

nrftsentation only a subset of the results are included. 

The dependent variables considered in this study were number of correct 
responses oS the semantic verification instrument, confidence for correctness 

• • ofresponses and latency of correct responses covaried by paragraph reading 

time on the semantic verification task. . 

Analyses for the dependent variables included the following: ranking and t 

tests for mean correctness and mean confidence on the semantic verification 
asks anE of variance on correctness of response for item type by true- 
a4e by ability- and for latency of response, multivariate, and univariate 

analyses of covanance utilizing paragraph reading time on the semantic 

verification tasks as the covariate. 



IERLC 



79 



As the semantic verification instrument was experimenter-constructed, 
instrument characteristics were determined, including an. application of the 
Ku(fer-Richardsorv. u for each- of the four 30-point scales representing an item 
type as well as tor 'the total 1 20 item test. The analysis indicated that the range 
of item difficulty was from .857 to .987; the reliability of subtest scores ranged 
from 655 to 429 and the overall text exhibited a reliability coefficient o,f .81 3. 

Since the- students were able to control viewing time for both the paragraph 
and the four item types that followed, and since items for the semantic 
verification task were drawn from high' "goodness of example category- 
prototype relationships identified by Rosch (1975) and the sentence and 
paragraph structure highly restricted, it was expected that verification of the 
truth or falsity of an item would not be difficult. In order to determine if this 
. were the case two analyses of the data were initiated: the first consisted ot 
calculating mean number correct, standard deviations, ranks, and f tests 
between ability groups for mean number correct for the eight treatment 
conditions the second analysis consisted of computing percentage of correct 
answers to true and false item types for each treatment condition. This was 
done as there was a possibility of 20 correct true responses but only 10 
correct false responses. These converted scores made possible calculation ot 
.analysis of variance to determine if interaction existed among item type, true- 
false conditions, and reading ability. 



Results 



The first analysis comparison of means, standard deviations, rankings, and 
f values for good and poor readers' correct true, correct false, and total 
correct responses indicated that good readers correctly answered 
significantly more items, f(58) = 3.94, p < .001, despite all sublets ability to 
control both the paragraph and item viewing time. Total means were llcS.yol 
and 108 47 for the two g'roups. Furthermore, examination of raw scores 
revealed that good readers had a higher mean number of correct responses 
under all eight treatment conditions. . 

For true items good readers had significantly more correct items for three 
item types: integration, f(58) = 2.89, p < .005, primed I category-prolo ype, 
f(58) = 3 02 p< 004 and verbatim recognition, f(58) = 3.68, p - .uui.iiems 
that required reading/or were at least influenced by the reading situation. The 
means for the three item types were 26.77, 24.67; 29.47, 28.53; and 27.93, 
25.80, respectively. _ 
Only one item type, verbatim recognition, favored good readers, rp«) - 
2 45 o< 017 when the dependent measure was correct identification of fa se 
items The means were 8.57 and 7.57. The lack of discrimination from the false 
'items may have been a result of the nonsensical nature of some of them, that 
is some of the items were so absurd that most subjects could reject them on 
the basis of logic rather than the passage in which they were embedded 

Ranking of true, false, and item type totals revealed that both good and poor 
readers portrayed identical rankings within item types. Correct items were^ 
rank-ordered for both groups from most correct to least correct as follows: 
unprimed category-prototype relationships, prime category-prototype 
relationships, verbatim recognition, and integration. 

To consider how item responses varied in terms of the ability groups, true as 
compared to false items, and item types, and to determine if interaction 
between these factors existed, analysis of variance was carried out bn 
percentage conversions of tfH raw scores. One between factor, abiUty, and 
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two within factors, true versus false, and the four item types were considered. 

Results indicated that while all three main effects achieved significance at 
the 01 level inert? were also two interactions that were notable: ability by item 
type h{3 14/) ."im t (j< 01, and true-false by item type, F(3,174) = 27.92, p 
< 01 The first interaction demonstrated that while .reading-required item 
types were more difficult than non-reading-required item types for both ability 
groups they were a great deal more difficult for poor readers. However, the 
second interaction between true-false conditions and itehn type complicated 
the interpretation of the main effects previously indicated in that most of the 
variation in these two factors stemmed from two extremely depressed cell 
means false verbatim recognition and false integration items. 

The percentage of errors for totaled item types was 5.05% for good readers 
and 9 16°o for poor readers. Some subjects made no errors on certain item ^ 
types resulting in insufficient data on which to make judgements regarding 
incorrect responses. The attention to correctness through trials, and request 
for confidence rating of responses seemed to have effectively encouraged 
subjects to concentrate on correctness. For these reasons analysis of 
confidence ratings and latencies focused on correct items only. 

Despite the ability of subjects to control the length of time they had to view 
the paragraphs and the items, ditferences in response correctness were noted 
on particular items. In order to determine if these differences might be a result 
of overreaction or a thoughtless response, subjects were asked to indicate 
how confident they were of their correctness. 

For Doth good and poor readers there was a high degree of confidence for 
their correct responses, all confidence means being at 4.61 or above. Ranking 
the mean true and mean false confidence indicated that for both good and 
poor readers the order of highest to lowest confidence was identical to that for 
greatest toleast in correctness of response. 

To determine differences in-- processing, mean laiencies for the eight 
treatment conditions for each ability group were calculated and, as would be 
expected good readers responded more rapidly in completing the reading ot 
the paragraphs and in correctly verifying items under the eight treatment 

conditions. . . . 

Taattenuate reading speed as a confounding variable in the consideration 
of the data subjects' response times to the items were covaried by paragraph 
reading time and the resulting times considered by multivariate analysis of 
covariance which simultaneously compared all four response times of good 
and poor readers to determine if differences did exi$t between the two groups 
for the adjusted latency measure. . 
' The adjusted means resulting from the multivariate analysis of covariance, 
reflect latencies that significantly favor good readers for both true, F(4.54) - 
4 10 p < 006 and false, F(4.54) = 4.80, p < .002, items. Not only were good 
readers faster than poor readers in their overall processing speed, but they 
also appeared to be faster in the completion of semantic verification tasks. c 

A second univariate analysis of covariance of latencies compared good and 
poor readers on each of the eight treatment conditions (four item types, true- 
false) Good readers responded more rapidly to semantic verification tasks in 
seven of the eight treatment combinations of levels; the qne exception was 
false verbatim recogniton. 

Conclusions 

As the attempt of this study was to examine relationships between two fields 
of study whose common bases had not yet been established, limitations must 
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be recognized in interpreting the results. Certain tentative conclusions, 
however, are warranted . -. 

Good adult readers do exceed poor adult readers in their ability to respond 
correctly to semantic processing tasks that require comprehension of written 
discourse. It would appear that good readers readily integrate new learnings 
acquired through reading into their conceptual framework while poor readers 
deal with the reading task as a singular presentation of information unrelated 
to what they already know j • . 

in terms of the item types themselves, true items favored good readers for 
three of the four tasks, the exception being that of unprimed category- 
prototype relationships The differences that were found were not a result ? 
confusion or lack of attention to the task. This was established by the few 
differences in confidence for correct responses by item type. 

Analyses of the latency, scoces for correct responses utilizing paragraph 
readinq time as a covariate indicated that good readers responded to both true 
and false items more rapidly than poor readers. Additional analyses, again 
usmq paragraph reading time as a covariate, demonstrated th*t good readers 
responded more rapidly in terms of semantic verification to seken of the eight 
item type true-false tasks. - • •<• „. 

These findings demonstrated that there were statistically, significant 
differences by good readers on a majority of the semantic verification tasks. It 
would appear that the developmental differences in semaritic processing 
favoring good readers, established by several writers', have been 
substantiated with adult subjects by the results of this study. 
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Perceptual versus semantic information 
processing in semantic category decisions. 

Rosch (1975) has developed a paradigm for examining the use <* adpnce 
,nformat.on in making semantic category decisions. Two features of his, 
•pSgm are worth discussing in" detail. First, Rosch has develop* a set of . 
norms for "level-of-goodness" for the semantic categories she uses Thai t is 
ScaSgory is represented by exemplars which have been rated on the basis 
■ of how well they fit the categories. For example, in the category FRUIT , 'apple 
°s TghTevel-of-goodness exemplar while 'kumquaf is a low J^f^ess 
exemplar. The level-of-goodness norms seem to be stable ; and reliable as 
indicated botb by Rosch's work arid that of Childrey and Kamil (1977) 

Second T Rosch uses a decisiorjask which incorporates advanced information, 
knn«/n as •nnminq" The decision task is to make "same category or 
"3r e ?ent S cat P egor7 judgments for pairs of exemplars from the category Of 
half of the trials, advanced information is proved in the form of the name 
of a category relevant to the word pair, On the remaindei of the tnaks no 
pnmin information is provided, in which case ye test pair » rxeoeded |by the . 
word 'BLANK' instead of a category name. Three types of word pai s are 
included physically identical pairs, categorically identical pairs, and deferent 
category pairs Physically identical trials contain the same word repeated for 
examo e Apple * * Apple' . Categoricaly identical pairs have two different words 
S'orr ^he same 'category, for example, 'Apple ** Orange" JD. eren pair 
have two different words taken from different categories, at the same ar pinnate 
ieve^oodness or example, 'Car ** Apple'. Subjects respond "same to 
■ both Physically "denial and categorically identical pairs by " 
-different- to the different category pairs by pressing another key. The dependent 
measure was reaction time (RT) to press the correct key. . 

Rosch found that level-of-gbodness was related to latency of decision tor 
cateooricaHy identical pairs with high level-of-goodness producing faster RTs. 
S also produced faster RTs for categorically identical pairs. Howeye r 
whTn 9 the pa' were physically identical, priming interacted with level-of- 
goodness These results have been replicated and extended by Hanson & 

K The ( emanation Rosch applied to these results was that subjects generated 
somT' aspecf of meaning'' on primed trials. A semantic aspect of meaning 
Srmlv Sita ed processing of primed categorically identical pairs as a 
Set on o MeveTof goodness * For physically identical pairs, the mterac .on 
occurs because the perceptual aspect of meaning is more like good exemplars % 
(facilitating decisions) than poor exemplars (inhibiting decisions* 

■Hanson and Kamil (1977) attempted to replicate the Rosch study. They 
found d?f?erences in processing of physically identical pa-rs, Jependen on the 
modalitv in which advanced information was presented. In the original Hosch 
m Sdv pomes were presented auditorily. When primes were presented 
vsual'lv n the Hanson and Kamil study, the interaction for physically 
Scai PI) pairs was not obtained. These results indicate that the facilita ,oo 
and inhibition of different level-of-goodness pairs involve the use of visual 

O 
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centers The PI results can be understood as a visual biasing in favor of a 

Stearns 

n w LmJtic (cateaoncaily identical) decisions. This indicates that ■ 



.altered for semanfc (categorically identical) decisions. , n s inu,^ >»» ^ 

3a5£5S2Es$S52S3ss 

processing (cf Hanson :j Kamil & Snyder. 1978) 

Method 

S,xteen ,un.or high school students participated in the study. Eight were 
safer ed from remedial reading classes and eight were selected from non- 
emediai English c asses. All students were paid for participation in the enroll- 
ment Sixteen college students also participated. They receded course credit 

The ndep-ndent variables in the normal Rosch paradigm are 1 ) Type of 
rmnoh 2ent.cal categorically identical, or different); 2) Level-of-goodness 
Sh medium or lowv'and 3) Advanced information (primed or unprimed). In 
m f stud ? modal ty of advanced information (visual or auditory) was also 
man-pulated hna.ly" reading abi.ity (college, good or poor) was another variable, 
in ail raw the dependent variable is latency of decision. • 
,R E r T,memal sess,ons lasted about one hour. Eacf ; parjapant receded 196 
tna i s half primed and half "blank", equally divided bet ^ een ^™ 
h fprpnt" In the oresent study, stimuli were presented on a cathode ray tube 
■ dSfay controlled by an ADS 1 800E computer. Reaction times to make decrsions 
were measured anu recorded automatically. 



Results 



P Tnr Z rieowicallv idenSdal trials, college students show priming advantages. 
Good read f Sow advantage Snly when the priming information is 

?,?d*orv Poor readers do not show any advantage of priming information 

Fn? L nhvs SI dentical trials, college students show an interaction between 
pnm^aS^fS^ss for auditory information. Neither good nor poor 

Sff-S- than .unior high school students, and good 
readers are faster than poor readers. 



Discussion 



These "results reveal some interesting "developmental" trends over the , range 
of stents tested Examination of Figure 1 makes these trends most apparent. 
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Within the interpretive framework of semantic (automatic) and perceptual 
(strategy) effects, college students and good readers show .similarities for 
semantic effects in both auditory and visbal conditions. This suggests that 
both of these groups have developed a well-organized ^semantic memory 
network which is reflected t>y level-of-goodness norms and decision latencies. 
Of the two- rfiodality conditions, however, the auditofy condition is easier for 
gogd readers and the results for this group match college students more 
closely. On physically identical triais, the auditory condition shows good readers 
developing the necessary level-Q^goodnfcss' effect. However, they do not seem 
to differentiate their perceptual strategy for primed and unprimed trials to 
yield the physically identibal interaction. 

Poor readers show neither the perceptual nor the semantic effects. Thus, 
the major implication of the present data is for a general trend in reading 
development. First a stable semantic category structure is established. Then, by 
college lever perceptual strategies based on that structure are developed. 
Junior high school students are still developing Ihe category structure (in 
terms o> level-of-goodness) found in college students. The poorreaders clearly 
showed reaction" times which were very slow compared to good readers. 
They were slower still for visual primes than for auditory primes. Good readers 
are further along in reading develomept than are poor readers. Specifically, 
good readers seem to be better than poor readers- at using advanced 
information for facilitating semantic decisions and adopting perceptual strategies. 

While the Rosch task used in this study does net involve the entire spectrum 
of reading skills, it does folate and examine two skills important in reading: 
semantic and perceptual processing. Since poor readers -have difficulty with 
this relatively simple semantic taskjt would be expected.^hat they would have 
even more difficulty with. the more complex semantic demands ..of. a reading 
task Similarly poor readfers have more difficulty than good readers in the 
perceptual aspects of this task though both groups are poorer than college 
students. Because perceptual strategies .are based on a stable semantic 
structure it may be fhat junior high school -students need development of 
their semantic structures and their consequent perceptual strategies. Additional 
work will be needed to determine 1) whether these findings are true when larger 
units (than words) are. used and 2) whether these differences found between 
good and poor readers can be remediated by conventional methods. 

References 



Brooks, L. R.' The suppression of visualization by reading. Quarterly Journal of Experimental 

Psychology, 1967, 1 9, 289-299. ' 
Brooks. L. R Spatial and verbal components of the act of recall. Canadian Journal of Psychology, 

1968 22, 349-368 . * 

Childrey J A & Kami! M. L. A preliminary examination of the relationship between category 

structure and reading. Paper presented at ihe annual meeting of the National Reading 

Conference.New Orleans, December 1 977. 
Hanson, R H $ Kamil, M L. Modality specific effects in semantic category matching. Paper 

presented to the Midwesf Psychological Association, Chicago, Way 1 977. 
Hanson R H Kamil M. L. & Snyder, C. R. R. AttenUonal versus automatic processing in 

semantic category decisions Presented to the Midwestern Psychological Association, Chicago, 

Rosch V ^Cognitive representations of semantic categories. Journal of Experimental Psychology: 
General, 1975. 104, 192-233 



87 



PRISQIULAA. DRUM 
University of California, Santa Barbara 



Vrose recall responses and categories for scoring 

The recall of what has been read contains responses, that closely 
approximate the original and .resppnsea that vary on some undefined 
continuum of difference from the original. .Much of the experimental research 
on reading comprehension indicates that competent^ readers recall the 
substance of the passage^ e.xj,, the text base (Kintsch, ,1975, 1976), the 
redundant information (Drum", 1974, 1976)> the hierarchical structure*(Meyer & 
McConkie, 1973; Filby/1975), and" the story structure (Thorndyke, 197^). 
These studies used various analysis systems for determining the' structure of 
the passages; the systems then become templates fQr scoring the recall 
responses since' the major interest or outcome was to determine the 
correspondence between the passages an^l the fbcall. 

However, the constructive view of memory (Bransford & Franks, 1973) 
postulates an integration of fragments within a text with a plan'for what must 
have occurred based on generalized knowledge of the word. Interference can 
come from inadequate. or inappropriately used knowledge or from selecting 
nonessential text fragments. Deviations from the text do occur even yvith 
college undergraduates, the typical subject for these studies. Recall for "The 
War of the Ghosts" (Bartlett, 1932) and for the biblical story "Joseph and His 
Brothers" (Kintsch, 1976) include contend additions, which were explained'as 
prefamiliarity with the "Joseph" story or as fitting the "Ghost" information 
into expectations by the reader for a cultural norm. These text deviations arftd 
some of the generated responses in the Cofer <1973) study seemed 
reminiscent of the classroom recall of young children after reading a story. 
Text recall is not just the reader's creation, but an adaptation of the 
information in the text to* his Qw'n prior Memorial structure for similar 
information. Responses are' interactions between what -was known before 
reading and what the text said. 

A purpose for this study is to describe thedifference response. The first step 
is to analyze text from different responses. To do so, thfe original passages are 
analyzed into a hierarchy of propositions using procedures similar to Kintsch's 
(1975) for deriving the text base. This consists of "a structured list of 
propositions.- A proposition consists of a predicate with one or more 
arguments: arguments are concepts or propositions themselves 1 ' (Kintsch, 
1977): The attempt is to delineate the meaning relationships of the text which 
can be realized with different function words and different orderihgs — i.e., 
active to passive alterations, or progressive to active changes. The linear 
ordering of propositions in a hierarchy established by argument repetition 
describes a structure of the passage, vyhich is used as a template in scoring 
textual responses that adhere to the meaning but deviate somewhat in 
syntactic 'and function word expression. 

The protocols are also analyzed into propositions where possible. There are 
often parenthetical remarks and incipient phrases which cannot constitute a 
proposition and, therefore, a word count rather than a proposition count is 



V8 



used. Also, since the $tR||£ts do not have time for editing and revision, their 
hierarchical structures are ribt described. 

Four ordinal categories have been developed to encode responses on a 
continuum from text specific to text external. The categories are global in that 
each contains subcategories with obvious explanatory power which are- 
summed together. The categories are:>1) Text specific responses — The units 
Qrnrpri t«xt - specific must correspond with the text propositions 
unambiguously. They are restatements of text propositions, including 
acceptable syntactic paraphrase, synonymy of elements with the same 
conceptual referent, and substitutions of pronouns appropriately referenced 
elsewhere. 2) Text entailed responses — The units scored as text entailed 
delude text specific elements put together In new ways and additions of 
information that are semantically , entailed by the- text and are correct 
according to a content expert. 3) Te*f evoked responses — The units scored 
as text evoked also include elements of the text, but they are either 
inappropriate recombination's,- additions of information external to text 
elements, or general statements that do not convey any specific information. 
h Text external responses — The units scored as text external either have no 
/relationship to the text, such as storytelling conventions, self-report 
comments, or repetitions of previously recalled statements. 

Thfe four categories and the° procedures for scoring are described in more^ 
■ detail in Drum and Lantaff (1977). 

Method 

- Sixteen eighth grade pupils, half of them able readers and half below 
average in reading ability, read four passages. Ability was determined by 
scores at the 90th percentile or above or below the 50th percentile on the SRA, 
Form E, reading comprehension- section. The subjects also represented equal 
numbers of boys arcfgirls. After reading a passage, each student was asked to 
orally recall the information. A day later, delayed recall for two-passages was 
taken; a week later, the remaining two. The cue for delayed recall was the.one- 
word title provided with each* passage. 'All oral responses were taped. Each 
subject knew in advance that he would be asked for recall of the information 
by title both immediately after reading and either a day or week later. They 
were also told to read as if they were reading a school assignment. Nobody 
was apparently attempting to rote memorize the material. The .recorded 
reading times averaged* 86 seconds, too brief for attempted memorization of 
'the passages. . 

The four passages (252^275 words^ were selected from science and social 
studies texts. The Fry readability levels for the passages were from 7.4 jo 8.O.. 
The passages were analyzed into propositiona'l hierarchies — four to seven 
levels in the hierarchies and 123 to 147 propositions. 

The design-was a repeated measures, mixed factorial with ability and sex 
between subject variables and type of passage, passage and amount of delay 
within subject treatment factors. 

The immediate and delayed recall responses were analyzed inter 
propositions, with function words counted as they appeared in*. the 
propositions. Every propositional unit was then given one of the four category 
designations. Each recall response was scored by three people. There was 
substantial agreement among scorers, approximately .80; above .95 for text 
and external and .60 to .70 for entailed and evoked. Any disagreement about a 
unit was reanalyzed until an agreement could be reached. Th^n the words per 
category per passage became the scores for further analysis. 



Tfte significance of the category distinctions by ablUty, by'%px, by type 
passage^ by particular passage withifi type, and by Amount of delay were 
tested using analysis of variance procedures. s 



v ■ ' Table 1 
Means and Standard Deviations by Ability for Protocol Categories 
-At Immediate -Recall °. * 
Total Text Entailed Evoked External 
M SD . M SD M SD M SD M SD (N) 



Ability: 



High 
Low 


141 
87 


50 
37 


50 
17 


26 4? 20 18 

15 26 11 *•' 26 
• 


15 
17 


23 

. '18 


14 

11 : 

t 


(32) • 


At Delayed Recall 
* « 


High 
• Low 


115 
69 


43 

33 


34 
8 


19 45 21 17 

6-26 18 22 


10 
13 


19 
' 13 


12 
8 


(32) 
(32) 
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Results & Discussion 











The major category means indicate that able readers remember more text 
responses at both imrfteduSte and delayed recall, F (1, 12) ='9.8, 12.2, p<. 01, 
and more entailed responses at immediate recall, F (1, 12) = 7.5, p < .05. 
Less .able readers tiave more evoked responses, particularly for science 
passages, at immediate recall test point! Delayed recall has on the average 
fewer propositions stated, but what is retaine.d is generally sjmilsr to 
immediate recall. The proportionate stability of. entailed and evoked 
responses from immediate to delayed recall as compared with the decrease in 
textual recall indicates that reconstructed responses are more likely to be 
remembered over time.- 

The ability by type of passage interaction for evoked responses, F (1 , 1 2) = # 
4.9,p< 05, for immediate recall is due. to an increase in general responses by , 
less able readers for the science passages, scienoe mean of 28 versus social 
studies mean of 23, while able readers have. fewer evoked- responses for 
science (mean of 16) than for social studies (mean of 20). Less able readers . 
are less likely to make delayed entailed responses for science passages (a 
mean of 13) than for 6ocial studies (a mean of 38), F (1, 12) = 17.6, p < .01. - 
Able readers do not differ. The only other interaction for ability is less able- 
readers make fewer external comments for the two passages "Engine" and 
"Women;" they also have fewer text and entailed, remarks and more evoked 
for these passages. However; able readers also make fewer text and more 
evoked for the same two passages, but they also increase their external 
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comments, F (1, 12) = 11.9, p <-.01. Able readers may be simply inclined to • 
make more statements. If they do not remember much of the passagg, they 
are likelylo makerriore comments about their poor performance. 
' Science passagf s were more difficult for these children at both test points, 
less text responses for All readers, F (1,' 12K= *27.0, 17.3, p < .01. Though 
readability was approximately the same for alrfour passages/reading -and life 
experience with methods for conserving heat in buildings and for the way 
gears work in operating machiitery is likely to be less for eighth graders than 
descriptions of the household work of women in the frontier or tbe problems of 
an explorer in tfte new world. The science passages are technical, providing • 
information about inanimate things. The social studies passages are about 
people. Human actions and reactions are familiar events; and readers can use ■ 
their prior knowledge about people to structure the information and thereby 
remember it. 

The results tor "Engine" exemplify the possible effects, of experience on, 

' learning from reading. More boys and girls may have tinkered with gears, not 
an innate predisposition but a cultural phenomenon. ThougK not significant, 

.girls of both ability groups generally have more text specific and less evoked 
responses. For "Engine" this pattern is reversed. Text entailed responsesare . 
identical for able readers generally, while low girls h£ve more of such 

♦responses than low boys. For "Engine" again the pattern is somewhat 
reversec j; boys of both ability groups generally Qut-perform the girls in the 
respective groups. In delayed recall the major change is more entailed 
responses' b^Sable boys versus able girls, a switch from thfeir general 
performances. . , - 

"Engine" is the most techrfical article. It requires more .prior experience to 
comprehend. There are phrases with two underlying meanings:, for example, 

: the crank&faft gear can mean it is a gear or it has a gear. General reading 
ability is the most potent "factor in learning from text, but 'experience also 
makes a contribution; at least,! experience is inferred tfrbe the source for the 
(differences in girls' high performance generally ^and relatively low 
p'erformance on "Engine." 

Explorations into the interactions between experience and the content of 
material are indicated. Blocking by prior knowledge, of the content as well as 
by reading ability and examining both the text and the difference responses 
may aid in determining the components' of learning from iext and the 
processes involved in reading comprehension. 

The categorical analysis applied to subject recall responses does indicate 
differences between good and poor corrcpreftenders. Though gach student's 
responses varied- as a result of the different passages, the categorical 
patterns of more text and entailed for #n abJe reader remained constant over 
passages. ^Categorical difference scoffs for each subject -for each passage 
were als& calculated. Able readers were more likely to bring together 
information from all parts of the passage, to make»text derived inferences, and g 
tcradd case-linked arguments. Less able readers seemed to divide into two 
groups: those that stated a text proposition and then repeated it, or those who 
picked up one idea and then elaborated on it though there was no apparent 
relationship to the text in the elaboration. Since all of the passages had a 
readability close to grade level, one would hav$*to see if the recall patterns 
held for simpler materials for the less abl<* and herder materials for the good 
readers. If replicable and replicated, categorical analysis could help define 
what is meant by reading comprehension 
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7 wo factors affecting text recall- 
« n ' 

A useful approach in discussing reading comprehension* is to view it as a 
process of acquiring information. During this process, the reader, utilizes 
surface ahd base characteristics of the text as welt as his ( internalizecj . 
psychological processes and existent knowledge in deriving a meaning from 
this written text (Kintsch & Vipond, 1977): Mature readers reconstruct their 
version of the author's intended message in part by utilizing the text as a kind 
of processing plan (Freedle & Carroll, 1972), and tJien integrating or 
assimilating this acquired information to existing cognitive schernata. 

According to the "levfi^ of processing" framework, proposed by Craik and 
Lockhart (1972), the n/ture and duration of a memory trace is determined by 
the kind of processing that is performed on input; an analysis of the recall that 
'readers* produce for texts they .have read does in many .ways seem to reflect 
'the processing that has occurred. The versions readers reconstruct usually 
include information that is specific to the text and inferences that were implied 
but not explicitly steted'in the text, as well as information conceptually related 
to the text that seems -to intrude into the subject's recall from existing 
knowledge. Consequertfly, it is suggested that a close look at the recall 
characteristics for differing passages'-may provide clues which enable us to_ 
make inferences v about the processed of comprehension as well as the text 
variables which influence thisprocesslng/ . • ' 

- This study examines the nature of text recall dbtainedat differentpoints in 
time for passages varied by subject and reading difficulty. In additon, a nested 
condition investigates the effect of delaying initial recall on subsequent recall. 

4 Method ' 

Subjects < y 

The subjects for this study were 16 graduate students from the University of 
California, Santa Barbara Graduat£ School of Education. 

Materials s - , *. * . 

Five passages.were selected from science or social studies texts, designed 
for fifth grade students and college undergraduates. These were texts 
currently available 1or use in the public schools. Eaph selection was 
approximately 250 words long with a readability between 5.0 and 5.8 for the 
easy and 12.8 and college for the'difficult passage conditions, a hard science, 
a hard social studies, an easy science, and an easy social studies. The easy 
social studies condition contained two passages 'because of the nested 
condition. The order of the four conditions was systematically varied for 
presentation with the position of the .two easy social studies passages 
counterbalanced within the design so that the passage which was not to be 
recalled at the immediate retfall session always occurred in the fifth position. 



Text Analysis . * 

Stimulus passages were Selected and then decomposed sentence by 
sentence into, the clausal units Contained within. These. clausal, units or ; 
"clausal propositions" wer'e comprised of a predicate and n-tuples of 
arguments.'Th&se'functioned as constituents of a subject or predicate term. A 
distinguishing characteristic of these "clausal propositions" was thaTt .their 
predicateMerm was marked by a gracnmatical verb in the text's surface 
structure. m j 

\ Once the serttence was parfe0d into clauses, these " were arranged 
hierarchically according to a spepified^ set of rules for assessing clause 
relations, based primarily upon the sentence-internal coordination, 

•subordination or complement relationships and levels of argument generality. 
After the first sentence was so arranged, the next sentence was analyzed in a 
similar manner. These clusters of propositions were then connected on the 
basis of . the relationship between the* head or nucleus (Longacre, 1977) ■ 
proposition for each ^cluster. If the head of the second cluster acted to further 
describe, specify or elaborate on the information of the first, it and all its 
dependent or marginal propositions assumed a lower position in the hierarchy. 
If, bn the, other hand, thrs second proposition introduced essentially .new 
information that functioned as topicat or thematic development, as opposed to 
propositions! elaboration, .then the head propositions assumed positions on 
the same hierarchical level. Each successive sentence was then analyzed and 
arranged Accordingly. 

Procedure • • 

Eaclt'of the passages selected was typed and randomly assigned to the 
students at the initial reading session. They were instructed to read each 
passage twice, f i est skimming it to get the general idea and then going back to 
read the passa^d more carefully. They were qske.d to try to read these 
passages in the same way they would for a class assignment. In addition, they 
were asked to record the times they tjeganand finished reading. 

When the subjects finished reading a story, they turned the passage over 
and, on g separate sheet which contained only the title of the preceding story, 
wrote down everything that they could remember. The subjects had unlimited 
"time for recall. Following thareading of the fifth passage, the subjects were 
Informed thgt no recall on this passage was : required * 

Eorty-eight ftours later, the group was again asked to recall ,the passages. 
For this session, as\vell as the third, the subjects were asked to reeaW all five 
of -the passages. The third and final recall session took* place one week after 
the initial reading session. Unlike the previous sessions in which subjects 
were furnished only with the passage title as a prompt, the "one week" recall 
sheets e^'ch contained lour noun Vcues" selected from the appropriate 
stimulus passage. Subjects were requested to recall all that they could about 
4he.cues and Jhen to add whatever else they could remember about the story. 
This proc^ucecHhree recall conditions: an imnrfediate free, 48-hour free, and a 
one-week cued recall condition. 

•'Following these-, the protocols were analyzed 'in a manner similar to the 
texts and partitioned into lists of clausal propositions. These were not 
arranged hierarchically, however. Each of these recall propositions was then 
compared to the corresponding text list and classified as belonging to one of 
three response categories. (Therfe are actually five categories in our analysis 
[see Drum & Lantaff, 1977), but two did not prove appropriate for college 
students with Written protocols.) The recall was scored as either text specific 
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(verbatim propositions, allowing for synonym substitution); text entailed 
(propositions which are bounded by the 'text, containing, specific arguments 
and predicates arranged in new ways); or text evoked (instrusions and 
geheralizations). \ * * 

Results and Discussion , , " ^ 

A summary of recall category means is presented^ Table 1. - \ 

Table 1 

Means and Standard Deviations for Response Categories by Sessibn 

c — 1 ' ■ ~ T 

Response Categories: " Total Text -specific Entailed Evoked 

Session 1 16.58(6.49) 9.66(6.05) 6.38(2.60) ' .56( .97) 

Sessiqn2 U-92(5.77) .5.22(4.61) 5.81(3.08), .91(1.18)- 

Session3 11.33(5.49) 4.20(3.48) 6.09(2.84) 1.03(1.26) 



As might be expected, there is a distingt drop in the total number of clausal, 
propositions remembered between the immediate and 48-hour conditions,, 
with the cues (four nouns) acting to stabilize recall between 48-hours and a 
week There are also qualitative changes that occur in the types of informatidn 
recalled. There is essentially a trade off that occurs better) the text speqific 
and text entailed categories; whereas the amount o^ specific recall 
•diminishes/the amount of entailed repall shows proportionate gain. It seems 
°1hat while surface level pragmatic relations fade from memory fairly quickly, 
subjects are Still able to reproduce text-relevant information. This finding 
would be in line with a constructivist interpretation of memory. 

The low amount of evoked recall is also of interest. In previdus studies, 
researchers have found that information seemingly related to a subject's 
previous experience frequently intrudes into s recall protocols when the 
passage content is highly familiar to (he subjects and/orthe time increases 
between reading and recall (Kintsch, 1974). This was not the case for these 
students. ' t ... 

The topics -of these passages were such that normal readers would be 
expected to have well developed conceptual base$ due to prior experience. 
Students should, it would seem, M ad lib"quite a bit as memory for Jhese texts 
fade; however, the responses were quite conservative.lt seems that when the 
memory for text specific relations gets weak, mature readers still maintain a 
"sense" of the text, what Kintsch has callecU "macrostrsucture". These 
students were reasonably sure of what they had not read and edited or 
constrained their recall accordingly. * 

Passage Differences , ' * \ 

Analysis of variance for the total recall of the passages yielded^ significant 
difference for reading difficulty of texts. F(1,15) = 4.91, p < .05 , but not for 
the texts of different types (science vs. social studies). There was however a 



sfgnificant interaction between text type and text, difficulty, F(1,15).= 4.57, 
p < .05. The primary factor contributing to this interaction is apparently the 
large recall difference between the science passages which does not occur 
between the social studies texts? The two social studies texts are quite distinct 
in terms of their measured /eading difficulty and yet are remembered with 
close to equal effectiveness. On the other hand, the twb hard texts are similar - 
in readability scores, but' are remembered unequally-. This raises some 
question about formula-measured readability and its * relationship to 
memorability and comprehensibility.* It seems that if these fcfrmul^s are 
measuring lexical and grammatical complexity, the frequent 1 practice of 
implyingJhat complexity equals difficulty, without considering" other reader ' 
and text Enables, may pjrove to be a precarious one. , » 

There is a relationship between the number of propositions in a passage and 
the amount of recall, Jn which the presence of more propositions in the text 
base correlates with higher recall. It is difficult to believe that fewer 
propositions simply mean that fewer are available for recall when the totals far 
exceed memory limits. Rather, it seems that since the numbefof words in a 
passage is approximately the same and yet the number of propositions differs 
(51 for the easy science, 48 for the hard social studies, 46 for the easy social 
• studies and 41 for the hard science passages), there is what might be called a 
proposition complexity factor. Propositions are composed of arguments and 
predicates, with the arguments serving a function in which they specify text 
\ referenced concepts! In the fifth grade passages, • the arguments (noun 
phrases) are quite basic, usually two or three words at the most, while 
arguments in the college passages tend to be onucti more complex. This. • 
'complexity or density of a proposition's- constituents may influence a 
proposition's memorability, More likely, however, this density is an indicator of ■ 
thematic specificity and the more specific the topic, the less likely the reader A 
is to nave an adequate conceptual base into which to integrate information 
learned from the text., This brings us to what seems a key factor in the 
differences in recall: prior knowledge. The hard science passage oh memory 
was unmistakenly the rpost esoteric of the stimulus texts and, it Is presumed,, 
the least familiar to the subjects. The mean difference between this passage 
and the others, which were mor^general in focus and thematically more likely, 
to be part of one's knowledge about the world, suggests a powerful influence, 
of the reader's prior knowledge on the integration and rearrangement of the* 
selected text information.^ : ' 

Immediate vs. Delayed ftecw/ . • . " , w • 1 

As mentioned, the students were presented with two. easy social, studies 
passages, counterbalanced and arranged so that one- passage occfired in 
rotation with the other three pasage types and the second ajways in the fifth 
and final reading position. As with the other three passage types, students 
were reqmested \0 recall the first easy social studies text immediately after 
.reading it, whereas there was no recall yequired for the second passage until 
the 48-hour delayed recall session. This condition was designed to ascertain 
the effects that delaying the students' initial written recall of a passage wouftt 
have on subsequent memory for that material. 

While there is no statistical differenpe between the storifes,. th^fe was a 
significant difference between the immediate and delayed conditions for total 
recall, F('\,'\5) = 4.69, p < .05, with tptal and text specific recall mlich higher 

1 Statistics reported have been corrected for possible bias due to repeated measures; through 
use of the Geisser-Greenhouse Conservative F test . ' 

© l'jt; •■ 



when students 'recalled the text immediately- than when they dejayed their 
recall 48 hours. This was the*cas£ despite.the fact that students spent a 
substantia^ longer amount of tirr)S reading the second, unrecalled passage. 



/ Table 2 x 

Means and Standard Deviations for the Immediate vs. Delayed* Initial Recall 



/ » _ 

Immediate 


Total 


Passage 1 

Text Reading Time 


Total 


Passage 2 - 

Text Reading Time 


Total 


Overall 
Text Reading Time 


11 88 


7 00. 119 38 sec 


13.38 


5 50 > 115.00 sec 


• 12.63 


6.25 ' 117»19sec 


(6 33) 


(3 85) ( 57 91) 


.(7 11) 


(3.85) ( 24 05) ^ 


( 6.55) 


(3.80) ( 42.89) 


Delayed - - * 


Total 


Passage 1 

Text Reading Time 


Total 


Passage 2 

Text Reading Time 


Total 


Overall . • 
Text Reading Time 


5 25 


2 88 166 25 sec 


800 


1 50 163 13 sec. 


6.63 


2.19 165.00 sec. 


(7 09) 


(4 55) (120 85) 


(5.95) 


1 77 ( 84 00) 


(6.48) 


(3 41) (104.25) 



* for the second session 



These recall differences held up for the third recall session as well. This seems 
to indicate that the nature of the activity which follows reading plays a 
substantial role in the processl>f assimilating to existing knowledge bases the 
information from reading a passage. , ? 

Conclusion 

The results of this study^uggest that the comprehenSibility of a text as 
indexed by the ability of subjects to recall its content, cannot be determined by 
» readability characteristics alone. It seems, as KintSch and'Vipond (1977, p. 41) 
tell us that "readability is not somehow an inherent propertyof texts, but is the 
result of the interaction beween ^particular text (with its text characteristics) 
£frid particular, readers (with their information-processing characteristics)."* 
This study also seems to indicate the pressing need for studies which 
investigate how the reader's knowledge base affects the comprehension and 
learning processes and that a text recall paradigm is a valuable Way to^ 
investigate this question. ' T 
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A system for scoring readers ' recall of 

propositions from t&xts y /, • 

Text 'information is decoded, recoded, and encoded*^* the 7 reading/recall 
paradigm. During recall sessions, readers generate numerous .propositions that 
are the results of operations performed both during- and afier the processing 
of texts Because no evidence exists at' the present time Jo conclude that 
verbatim recall is equivalent with the absence of processing/comprehension 
during reading, it must be assumed that all recall is the function of operations 
performed by readers. All operations that readers perform maybe referred to'as 
inferences. 

• Evidence of inference that readers generate has been derived frormlhe 
coding of elicited data in recall. Kintsch (1977) has devised a prototype, 
scoring system for coding general recall of readers. Texts and reader's recall 
are decomposed into propositions and matched against one another to determine 
the ways in which readers delete, reduce, or elaborate original prop6sitions. 
Drum (1977) has modified the Kintsch system to five basic categories. 
Fredericksen (1976, 1977) has devised 1 3 system specifically for scoring 
inference rather than general recall dgta as Kintsch and Drum had done by 
developing a list of inferences (types and functions) that readers generate 
based on young children's story reading episodes.' 

' The purpose of this study was to generate an efficacious system for scoring 
the output of inference^ generated by adult readers. In order to create this 
system two separate studies were designed to examine the inferences that 
readers make when they are processing texts. The first investigation was an 
* introspective case study of two. 'adult readers; each was asked to read two 
passages and to comment on the text as she/he read. The two' adults wfcre 
encouraged to articulate their internal observations about the texts- and the 
mental operations 'they were performing whiles reading.* Their observations 
served as a basis for the creation of the scoring system that was generated. 

The second study was a formal, experimental investigation of readers' recall. 
Passages were developed to elicit information under three sets ot conditions: 

1) congruence/incongruenceof propositions within texts * - 

2) wholistic/additive modes of presentation 

3) free/cued recall j 

(Method 

Sample • , . . , 

Eight college freshmen enrolled in an English composition course participated 

in the study. 

Preparation of Texts 
A four-sentence passage containing complementary propositions was gener- 

S 

•Data from this ecological study have been used as referents for designing a scoring system; 
these data will not be presented in this study. 
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ated. The syntactic frame of this passage was used to construct a second • 
passage that contained contradictory propositions; the concept load was con- 
trolled and matched with the first passage. Two additional passages (one 
complementary and one contradictory) using the original fradne were also 
generated. 

Preparation of Cards for Cued Recall' • 1 ' 

A series of 12 c^rds, each containing a single lexical item, wa% created for' 
each passage. 3 * K 

? • . ' 

Procedure 

Each student was asked to read two passages in two separate presentation 
modes: 1) whole passage or 2)- additive (sentence by sentence). Each subject 
received either two complementary, two contradictory, or one complementary 
and one contradictory passage. 

After reading each passage, or each sentence in the additive presentation 
mode, students' were asked fo write recall of the text from memory. 
After the. additive mode,,. subjects we're asked to write a recall summary. 
Subjects were asked to sort the cards,. containing a single lexical item, into two 
categories: recalled/unrecalled. When the cards weFe sorted, subjects .were 
asked to reconstruct the passage by using the cards as cues for recall. The 
same procedure was repeated as each subject read his/her second passage. 

Scoring System ' 

The recall data from .these two studies and from a previous study (Flood & 
Lapp, 19777'directed the generation of specific categories within the proposed 
' scoring system. The system is based on the observed logical operations of 
readers. Each category and subcategory was designed from one or more bits of 
information recalled by our readers; This^system differs from previous systems 
because it is based on the output of readers. Categories were only generated 
from readers' recall; the system was not generated from presupposed logical 
or psychological categories. The system is presented on the following pages. . 
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Each category has several subcategories. The generic category and each of its subsets with explanations and exemplar 
are presented below: .. . , 



Complete Scoring System 



"Text 

Jason was a lawyer. 



Example 



ERLC 



i|, generating text identical information. 
> « identical recall of information written in text 

II. ^generating macro/micro'structures 

creating larger or smaller units to accommodate rt 
text information ; 

.1. synon»ny-a narrowly defined category; tradi- couch 
tionaliy acceptable synonyms. This category 
assumes a high degree of rater reliabfttty . 
Syfnjnyms can be conventionally acceptable gke 
/ couch/sofa (cf Thesaurus) or text specific * 
* jjt colloquial (tauraBve) synonym-acceptable synonym policeman 
- within a specific context 

3. superordifiate 
recall of the larger unit t? which text element* . 
belongs 

4. subordinate 
recall of small unit of which text element is a 

part 

5. categorization 
generation of larger concept that encompasses 
several text elements 

III. generating cause 

establishing, preceding- or succeeding information 
that can place an event within a framework that can 
be tolerated by the reader 



bear 
flower 

uniforms, drums, batons 
marching people 



Hi 



Recall 

Jason was a lawyer, 
sofa . * 



cop 
animal 

daisy 

parade 



Complete Scoring System 

1 . text prqactive * 

extracting previous information from text that 
explains events as effects of causes 

2. text retroactive 



3. experience proactive 

presumptions about events that preceded and 
caused the existing event 

4. experience retroactive 

assumptions about events that succeeded the 
existing event 



generating dimension 
creating a spatial, temporal manner framework that 
can be tolerated by the reader 

1. space , 

placing an event in space . 
(metric or nonmetric) 

2. .time 

placing an event in time 
(metric or nonmfetric) 
J3. motion 1 
recalling movement 

• 4. manner 

recalling specifiable characteristics * 



Example 

Text 

Jason was a lawyer. He became 
a dentist. 



Recall 

Jason realized he~ 
made a mistake., 



Jason liked Chicago. 

Jason moved to Cheyenne where 

he enjoyed his business. 

Jason's business was successful. 



Jason's business was successful. 



Jason moved from 
Chicago to Cheyenne 
because he didn't 
like his job in Chicago. 
Jason's family gave 
himagr^at deal of 
money. 

Jason was successful as 
-a lawyer after he sold 
his business and con* 
verted his business • 
assets into client contact. 



Jason practiced law. 



Jason studied law. 



Jason's business was 
transcontinental, 
stretching from urban 
to rural. - — 
In the autumn, Jason 
e studiedi 



Jason's business was 
transcontinental. 

Jason studied. 



Jason flew from coast * 
Jo coast to help his 
* business. 
Jason studied assiduously. 
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Complete Scoring System 

V. accommodating referents 

establishing appropriate referents for ambiguous 
^text elements . ^ g 

1 . conjunctive 
joining two elements 

2. syncretic * » 

merging diverse elements into a single element 

3. disjunctive 

t ' recall of one selected element 
* 4. episodic 

- sequencing events in a temporally fixed, 
irreversible order 

5. additive 

creating two sources to accommodate diverse 
information 

6. anaphoric 

establishing a pronomial referent him. 

VI. generating case frames 

creating case frameworks for text elements 



VII. generating attributes 

creating modifications for actors, events, places, 
, or dimensions 
1 . actors 
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2. events 1 , 

attributing qualifications to events 
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Text 



Example 

Recall 



Jason was an architect. His 
fellow engineers praised him 
Jason was an architect. His 
fellow engineers praised him 
Jason was an arohitect. His 
fellow engineers praised him 
Jason was an architect. His 
fellow engineers praised him 



Jason was an architect. His 
fellow engineer praised him. 

They praised. 



Jason learned law. 



'Jason was a lawyer and 
a dentist. 
Jason was an 
architectural engineer. 
Jason was an engineer. 

Jason w0s a lawyer, but 
stopped being a lawyer. 
Then he became an 
engineer. 

Qne Jason was a lawyer. 
Another Jason was an 
engineer. , , 
The engineers praised 
him. 4 

Jason was taught law 
by the faculty of 
Tul^ne Law School. 



Jason studied law. 
Jason led the parade. 



Mijd-mannered Jason, 
the bookworm, studied 
law. 

The parade was the 
grandest show in Dublin 



f 

Complete Scoring System 

3 places 

adding specificity to places 

4. dimension 

attributing characteristics to dimension 

VIII. generating text erroneous information 

making incorrect inferences • 

IX. generating text external information 
f using established conventions of recall that conveys 
no new information. 



Example 

Texf 

Jason studied in Louisana. 
He moved to Chicago. 



an accountant. 



Jason was a lawyer. 



Recall 

Jason studied in the 
humidity of the Southern 
swamps. 

He moved very far from 
Tulane. 



Jason, once a lawyer, became* | Jason studied 



architecture, but 
practiced nothing. 

I don't know why Jason 
was what he was. 



in 



Implications 

The scoring ^Stem presented in this study yielded comprehensive information 
about readers' recall of non^arrative passages. However, for this system to be 
used with narrative materials, there, is a need to supplement it with a second 
system that may account for literary conventions that enhance or limit 
comprehension, e.g.,' knowledge of dialogue rules. Perhaps this supplementary 
system could be incorporated into a single system that would be useful for 
analyzing many different kinds oi texts. 
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What makes reading difficult: the complexity of structures' 



Reading comprehension appears to be effected by the complexity of the 
linguistic , structures used in a passage (Goodman, 1 967; Ruddell, 1963; 
Strickland, 1962; Pearson, 1 974-1 975)/ To test this hypothesis two versions 
of a/story, "Helen Keller'" (which appeared in the Released Exercises of the 
National Assessment of Education Progress (NAEP) Reading Data, 1975) were 
us ?$ " a Simple Version and a Complex Version. First three methods were 
use&to measure the complexity of the stories. Next, Cloze Versions (Simple 
Cloze Version/Complex Version) were designed and administered to. check the 
complexity as a measure. of reading comprehension. 

Part I: Applying Three Different Methods to Measure Complexity - 

In the first part of this study, three different methods were used to attempt 
1o measure the complexity of the two versions of the "Helen Keller Story", 
(a) Four readability* formulas were used: Dale-Chall, Flescri, Fry, and Bormuth 
Readability. Readability formulas are frequently built on sentence length and a 
vocabulary measure. The formulas were used to determine whether differences 
in complexity between the Simple Version and the Complex Version would be 
reflected in readability scores, (b) Secondly, the mean number of words per T : unit 
was calculated. The T-unit has been defined as an independent clause with 
its dependent predications (Hunt, 1 965)! Some researchers have found the mean 
number of words per T-unit to be an effective measure of the complexity 
of writing (Hunt, 1965; O'Donnell, Griffin, and Norris, 1967; and O'Donnell, 
1976). Thq mean number of words per T-unit increased as' writing became 
more complex, (c) Thirdly, the Schmidt-Kittell Linguistic Complexity Scale was 
applied. When a study of the syntactic complexity of the spontaneous 
story-retell responses bf 300 students to six different modes of story pre- 
sentation was conducted (Schmidt, 1974), this Scale (S-KLCS) was sensitive in 
determining the linguistic complexity of oral and written retells. It .appeared 'to 
measure within T-unit and inter-T-unit complexity more sensitively than mean 
number Qf words per T-unit. Thus is was applied to both versions of the 
"Helen Keller Story". f, 

The original version of the "Helen Keller Story" as rbleased from NAEP was 
used as the Simple Version in this study. Next the researcher manipulated 
the syntax of the story to create a Complex Version. The .basic vocabulary was 
not changed. Prepositional phrases and participal phrases were preposed, noun 
structures were altered to- use gerund forms; genitives were preposed; 
appositives and passive forms werfe used. For example: 

PARTICIPALS: Succeeding in arousing ... (PREPOSED) COMPLEX 
Miss'S succeeded in'arqusing ... SIMPLE 

' GENITIVES: Miss S spelled object's names ... COMPLEX 
Miss S spelled the names of objects ... - % SIMPLE 

PASSIVES: Speech lessons were begun ... COMPLEX 
She began speech lessons... SIMPLE 



Applying the Readability Formulas 

When the four" readability formulas were applied to determine the relative 
complexity of each version, the grade-level scores varied from .2 to\5 grade- 
levels between thfe Simple and Complex Versions, with the Complex Version 
being rated slightly 'more complex by each scale. The designers of most 
readability scales call attention to the fact that the grade level obtained would be 
limited to being. within approximately a year o.f the difficulty level of readers. If 
this statement is considered, then none of the readability scales used measured 
the relative complexity between the two versions of the story, 

Applying the T-unit 1 

The mean number of words per T-unit was computed to see if this might be 
an effective measure of the complexity of these two versions of the "Helen, 
Keller Story". Hunt found that as students mature from grades four to twelve 
thepuse more words per T-unit. The research of O'Donnell, Griffin, and Norris 
-supported h is data (O'Donnell, 1976). In this study, just the opposite effect 
appeared. The Simple Version had more words per fc T-unit ,<1 5.64) than did the 
Complex Version (1 3.09). T-unit length was shortened but the passage appeared 
to be more complex. . - 

Applying the Schmidt-Kittel Linguistic Complexity-Scale (S-KLCS) 

To apply the Schmidt-Kittell Linguistic Complexity Scale, a numerical value 
was applied to each manipulation qfe the syntax. Anaphoric structures, 
prep'osing. postposing, "unusual wbrttjrders, etc were weighted. Then a 
syntactic Complexity ratio was comput^jfby dividing the total number or opera- 
tions by the number of T-units. The fcJIean Ratio Score of the Simple Version 
of the "Helen Keller Story" was 3!68 when the Schmidt-KitteH Linguistic 
Complexity Scale was-applied; the Mean Ratio Score of the Complex Version 
was 4.34. These Mean Ratio Scores were significantly different, t(20) = 3.60, 
p .01. The Schmidt-Kittell LinguisticfcComplexity Scale appeared to be a 
sensitive tool to measure the differences in complexity between these two 
selections 

Summary of Part I 

There appeared to be no differences between versions when measured by the 
four readability scales: Jhe Complex Version was measured as only .2 to .5 
grade-levels more complex on any of the readability formulas. 

When the mean number of words per T-unit was computed, the mean T-unit 
length of the Complex Version, 13.09, was shorter than the. mean T-unit 
• length of the Simple Version, 15.74. This is the opposite of frequently-cited 

The Schmidt-Kittell Linguistic Complexity Scale appeared to be a sensitive 
measure of the relative complexity between the two versions. To test this 
sensitivity, a cloze test of comprehension 1 was set up using the Simple and 
the Complex "Helen Kelier Stories". 

Part II: Administering the Cloze Tests of Comprehension ; 

The cloze technique was applied to both versions of the "Helen Keller Story" 
so that they could be used as tests of comprehension. Directions for. making 
and taking the tests followed Bormuth (Page, 1975). The fifth word, in each 
passage was deleted. , • 

The two versions (Simple Cloze Version/Complex Cloze Version) were then 



gjverVto ttoee classes of university students: two classes of senior education 
student? enrolled in p reading/language arts course and a third class of 
graduate students enrolled in a reading/language arts course. A total of 84 
students were te&ed. Students were given the Cloze Tests prior to a lecture 
on the*' use of cloze' as a measure of comprehension so that they could 
experience the laking of such a test. Either the Simple Cloze Version or the 
Complex Cloze Versioh was distributed to the students randomly. The cloze 
scores were computed following Bormuth as cited. Cloze scores for each 
university class are given in Table 1 . t 

Table 1 • 



Cloze Scores of University Seniors and Graduates on the Simple 
- Cloze Version and the Complex Cloze Version of the 
. "Helen Keller Story" 



A. 

Classes 




Simple 


Version 
Complex 


Difference in 
Cloze Points. 


Summer 77 Graduates (N = 


32) 


66.21 


48.62 


17.59 


Autumn 76 Seniors (N = 


22) 


69.50 


57.80 


11.70 


.Spring 76 (N = 31) 

^ _ — 




70.39 


55.23 


15.16 



While both versions were well within the range of scores which would 
indicate that comprehension was not difficult for university students, it is 
interesting to note that university students would drop from 11 to 17 cloze, 
points when given the Complex Version. (Bormuth suggested a score of 44% 
right could be compared to instructional level on an informal inventory, while 
62% could be compared with an independent level.) A 2x3 analysis of variance 
for unequal NTs (Winer, 1 962) was applied to the data. F(1 79) = 36.27, p .05, 
indicated highly significant differences between the two groups in cloze scores 
when comprehension was analyzed on the Simple and Complex Versions. No 
differences existed among classes, F(2,79) = 2.28, p .05. 

The Simple Version has been used in the NAEP data with fourth, eighth, and 
eleventh grade students. Only about half of the story was used at grade four, 
but is appeared to be at a relatively complex level of readability. 

Overall Conclusions - 

The results indicated that (a) the readability scales used in this study did not 
appear to measure the relative complexity between the two versions of the story, 
(b) mean number of words per T-unit did not measure the differences in 
complexity between the versions either, and (c) the Schmidt-Kittell Linguistic 
Complexity Scale appeared to be a sensitive measure of the linguistic 
complexity. This was supported by the performance of university students on the 
cloze comprehension tests; they demonstrated difficulty in comprehension as 
measured by cldze scores on the Complex Version. 

The researcher hypothesized th^tj^ading became more difficult because the 



reader processing langijage could not make sense out of the story; the reader- 
could -not comprehend it (Smith, 1975; Goodman, 1975). Pearson (1974-75) 
'called for studies in which different versions of passages are constructed 
according to some rule-governed procedures'with questions asked relevant to 
the structural changes affected. In this study, for pne thing, fhe syntax 
was not as predictable in the Complex Version. Second, the language was less 
redundant (Smith, 1975). Third, more deletions occured; therefore, when the 
reader was processing the language there^A/ye fewer dyes to enable the reader 
to get at the meaning. Fourth, unusual grammatical structures were used. An 
example of a T-unit may help to clarify these four points. 

Simple Version 

" She also wrote many books and articles and included an 
2 4 3 5 32 3 3 213 1 

autobiography of her early years. "*1 5 words, S-KLCS.Ratio = 3.53 
■6 3 3 5 3 / 

Complex Version 

"Her writing, -many books and articles, inlcuded an 
3 5 12 5 3 23 311. 3.1 . . ' 

autobiography of early yeaCs." 1 2 words, S-KLCS Ratio = 4.25 
6 3 1 5 3 . • 

.The underlined words were those deleted in the Clozfe Tests. The numbers in- ■ 
'dicate the numerical weightings of the S-KLCS. • 

In the Complex Version, the common Ny patter! was replaced with a 
'possessive personal pronoun/geraqd phrase.-tippositive phfase, verb. Elements 
of a relative* clause "which were" were deleted to make the apposiiive 
phrase The conjunctive was deleted; a comma was inserted. .Other deletions 
included "also" and the possessive personal pronoun 'her" which rfcd 
identified whose "early years" were being discussed. The reader must process 
the qerund "writing " acting as the subject of the verb "included instead of 
the usual NV construction "she ... included". The deceivingly small changes 
in syntax and the use of only three fewer words appeared to make this 
T-unit much more complex to process. , 
: Psycholinguistic analysis of how the reader may be processing, complex 
linguistic sructures is of real interest. The complexity of our language makes 
the complex process of reading even more comple*. . • 
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Immediate memory for sentences of fast and slower 

readers as a function of rate of presentation J 

In speculations as to how'mature and rapi t d readers read, one of the main 
issues is how meanirig is derived from the surface representation of sentences. 
Huey (1968) and Smith <1 971 ) submitted that meanings were suggested 
immediately by the visual forms. v Athey (1971) thought that word identification 
processes are bypassed or telescoped , in Smooth reading. One step towards 
the understanding of such processes would be fhrough comparative studies 
of fast and slower readers on what is remembered of both the meaning and '* 
surface representation of sendees. This study investigated differences between 
fast and slower readers ^Immediate memory for. different aspects of 
sentences in connected discourse as a function of rate of presentation. The 
design of the study focused on the interactions between the variaoleg 1) type of 
reader 2) rate of presentation, and 3) type of test probe (meaning or non-meaning). 

Studies by . Huey (1968), Gates (1922), and Gilbert (1959) indicated that 
rapid readers are superior to slower readers in their memory for literal content. 
Bower (1970) found that at a fast rate of presentation of single sentences, 
slower readers detected changes in meaning as accurately as rapid readers. 
These studies, however, either did not use connected discourse or did not 
control the rate at which the materials were read. 

Memory for both meaning and the surfaCe representation of sentences were 
studied by means of a recognition paradigm developed by Sachs (1967pin 
studies using similar paradigms, the relationship between immediate memory 
for meaning and surface features of sentences differed depending, on the 
experimental circumstances. Meaning scores were higher than (Pezdek &* Roger, 
1974), equal to (Sachs, 1967), or lower than (Tyler,, 1971) scores measuring 
surface representation. The variance in fesults indicated that such a paradigm 
was serviceable in measuring differences in. memory for both meaning and 
surface representation. 

Method ■ 



The study employed an ANOVA design with two levels each of type of 
reader (fast and slow)-, order of ratte of presentation (eight fast pa'ssages 
followed by eight slow passaged or eight slow passages followed by eight fast 
passages) amount of intervening material (0 syllables or 80 syllables), and rate of 
presentation (160 words per minute or 480 words per minute). These combined 
with four types of test probes (semantic, syntactic, lexical, 'dentical) formed 64 
possible combinations. Each subject whs assigned to one of these combinations. 

The 16 experimental passages were counterbalanced so that a version of 
each passage was given to each subject. Scores- for the delayed condition 
were not included in the statistical analyses as these passages served merely 
1 as filler passages to discourage the readers from assuming that the last sentence 
of the passage was always tested. As fast and slow presentations were m«|m 
blocks, the order of rate of presentation was treated as a control variable 



with passages within each rate cpndition being randomized. The orders of 
presentation ot rale were randomly assigned with both levels being represented 
an equal number of times y c ' . ■ * ^S; 

Subjects ' • • 

The 64 high school sophomores who served as subjects included 32 fast 
readers and 32 slower readers selected on the basis of Nelson-Denny Reading* 
Test scores administered in their freshman year. The students were matched 
for reading comprehension. Pre-experimentaf analysis showed no significant 
differences in comprehension or vocabulary between the groups, but a significant 
difference in rate. f(31) = 2~67,p < Of . The fast group read at an average rate 
of 468 words per minute while the slower -group read at the average rate 
of 246 words per minute / * > , 

Materials * - 

The 16 passages of connected discourse used were developed by Sachs 
(1 974) for a study on Memory loss of semantic and surface features of sentences. 
T^e short form of these passages tested immediate memory whereas a form 
longer by 80 syllables tested delayed, memory. The readability, level of the 
passages ranged from third to eleventh grade according to Fry's readability 
formula (Fry. 1968) with the av r age at the seventh grade level. The passages 
ranged from 35 words to 21 4,wqrds with an*average length of 1 1 7 words' 

Each passage contained a "base sentence with that sentence presented 
again, changed or unchanged, as the test sentence. The test sentence, the onset 
of which was signalled by an asterisk, occurred immediately after the Dase 
sentence or* after a delay of 80 syllables: The lest sentence tested the 
'subject's recognition memory of the base sentence, and involved a semantic, 
syntactic, or lexical change fcom the base sentence or was identical Jo the 
base sentence , . 

For example, if 'the base sentence was "The legend is that Martin once made 
a cloak foi\k poor mart," the test sentences'(involvirxg respectively semantic, 
syntactic, lexical, and no change) would be: 

"The legend is that a poor man once made a cloak for Martin." 

"The legend is that Martinonce made a poor man a clbak." 

"The story is that Martin ooce made a cloak for a poor man." 

"The legend is that Martin once made a cloak for a poor man." 

Procedure 

Students were tested individually. They were given practice passages and SQts 
of practice sentences to assure that they would know what semantic, syntactic, 
and 'lexical changes were. Both the practice passages and experimental 
passages, interlaced with answer sheets, were placed before the subject, 
with the top passage covered by a cardboard. Two practice passages were first 
presented, one at 160 words per minute and the other -at 480 words per minute 
in random order The pacing was done with a slotted cardboard with an 
opening large enough to expose one line at a time. Passaged were structured 
to average eight words per line so that hand-parcing at the two rates by the 
experimenter was possible. 

The 16 experimental passages were presented in the order and rates 
designated by the design of the study. After the test -sentence was read, the 
passage was removed by the experimenter and the subject marked the 
answer sheet that was uncovered. The subject first indicated whether he or she 
thought the sentence was changed or identical and rated his or her confidence in 



the response on 'a scale from 1" to 5. This yielded a change detection score. 
If changed, the subject indicated the type oKSNange: semantic, syntactic, 
lexical, or don t^know This yielded* an identification score. Thus there, were 
two sets of scores' change detection scores and identification scores. 

Analysis 

The statistical analysis consisted of. separate four-way analyses of variance on 
the .change detection scores and *on the identification scores. The between 
group variables were type of reader and order of rate of presentation. The 
repeated measures were rate of presentation and probe type. Significan^main or 
interaction effects were analyzed, when appropriate, tyy t tests on simple main 
effects or by the Newman-Keuls procedure for tests of significance of differences 
between pairs of ordered means 

Results 

Change Detection Scores 

No interaction effects were found The results for the main variables were 
as follows 

1 No significant difference was found between fast and slower readers in 
their ability to detect change 

2 The mean score for the slow-fast order was significantly higher than that 
for the fast-slow order, F(1 ,60) = 4.90,p <05. 

3- The mean score for the slow rate was significantly higher than that for the 
4 fast rate', F(1, 60)= 10.61, p<.01. 

s 4 The main effect for probe type was significant, F(3,1 80) = 2.65.p < .05. 
The semantic probe was significantly higher than the syntactic, lexical, and 
identical probes at the 05 level 'The syntactic probe was significantly higher 
than the lexical and identical probes at the .05 level 

Identification Scores 

The results for the main variables and significant interactions were as follows: 

1 Fast readers scored significantly higher than slower readers, F(1 ,60) = 6.53. 
p<05 

2 No significant difference was found between the slow-fast and the fast-slow 
orders of presentation 

3 The mean score for the slow rate of presentation was significantly 
higher than the fast rate, F(1 .60) = 1 6 80,p < .01 . 

4 .The main effect for probe type was significant, F(3,1 80) = 6.05,p <.05. 
The identical probe was significantly higher than the other probes at the .05 
level (As the chances for correctly identifying an identical probe (50.%) were 
significantly higher than the chances for correctly identifying the other probes, 
identification scores for the identical probe cannot be compared meaningfully 
with scores for the semantic, syntactic, or lexical probes) 

5 The interaction between type of reader by rate of presentation was 
significant, F(1,60) = 6.04.p<.05: Whereas there was no significant difference 
beiween fast and slower readers at a slow rate* v at the fast' rate, fast readers 
outperformed slower readers, f(60) = 3.1 5,p <.05. Also, there was no signifi- 
cant difference between the slow and fast rates of presentation for the fast 
readers, but there was a significant decline in scores for the slower readers, 
/(60)= 4 66,p< 05. 

6 The interaction between order of rate of presentation by rate of presentation 
by probe type was significant, F(3,180)= 3.31 ,p<.05. The figures drawn 
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depicting this interaction disclosed that the semantic probe at the fast rate at 
the slow-fast order behaved differently than the other probe: the score for 
the fast rate was higher than for the slow rate. 

7 The interaction between type of reader by. drder of rate of presentation 
by rate ofepresentation by probe type was significant, F(3, 180) = 2. 66,p<. 05. 
An inspection of the figures drawn for this interaction followed by statistical 
analyses of lower order interactions disclosed that at the slow-fast order at the 
fast rate, the semantic probe score for fast readers increased significantly 
over the score at the slow rale, f(30) = 2.58,p <05, whereas for the slower 
reader at the slow-fast order at the fast rate, the identical probe score increased 
significantly over the score at the slow rate, f(30) = 2.08,p COS. 

Discussion 

The discussion focuses on the two main findings related to differences 
between fast &nd slower readers - The first was that slower readers can detect 
changes as readily as faster readers at both fast and slow rates of reading, but 
faster readers outperformed them in identifying changes at the'fast rate.. Tfiis 
suggests that the detection of change and the identification of change are 
somewhat different processes with the latter being more susceptible to disruption 
by excessive speed. The ability to detect change . may involve a less differ- 
entiated memory of sentences while the ability to identify changes may involve a 
more accurate memory of particular elements in sentences. Thus, the superior 
ability of fasPreaders in process-reading material at fast rates may depend less 
on an undifferentiated memory of sentences fhan on a more accurate memory 
of their particular elements. 

The second finding was that with a practice or warm-up period in reading, 
faster readers' memory for the meaning of sentences improved, whereas in 
similar circumstances, slower readers' memory for the surface representation 
of sentences improved. This accidental finding is startling in that, in both cases, 
the experimental groups performed better at the fast rate than at the slow rate 
but with respect to memory for different aspects of sentences. It suggests 
that readers' typical strategies and styles may not become immediately apparent 
but require a warm-up before they take effect, thus tending to validate warm-up 
♦ practices in speed-reading courses before other exercises are undertaken. It 
also raises questions -about the validity of speed tests that do not provide 
a warm-up. These results indicate that there may be some basis for .the 
speculations tha'. fast and slow readers' differ in important ways in their pro- 
cessing of meaning and' the surface representation of sentences. 
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Investigating the "print to meaning " hypothesis 



One of the current "big questions" in the study of reading is what process 
readers go through as they read silently. Mapy reading researchers and 
theorists argue as Carver (1977-1978) does that "during rauding, most people 
say the words to themselves; that is, they internally articulate each successive / 
word in a sentence" (p. 15). Others deny the existence of an intermediary 
speech step in fluent silent reading. Goodman (1970/1976) states that "when 
silent reading becomes proficient, it becomes a very different process from 
oral reading, It is much more rapid and not tied to encoding what is being read 
as speech" (p. 482). Smith (1975) states that "...reading cannot be 
considered a processing of 'decoding' written symbols into speech; it is 
neither necessary nor possible for writing to be comprehended in this way. 
Instead written language^jpefst be directly attacked for meaning . . ." (p. 184). 
Goodman and Smith are not the only theoreticians who argue that reading is a 
print to meaning process. LaBerge and Samuels (1974/1976), while not stating 
that silent reading must proceed directly from print to meaning, do suggest 
that "we should note the possibility in the model thaU visual word code may 
be associated directly with a semantic meaning code. That is a unit . . . may 
activate its meaning . . without mediation through the phonological system 
(p 564). Rozin and Gleitman (1977) present a substantial amount of evidence 
and argument on both sides of the "print to meaning" question. 

The purpose of the investigation reported here was to further empirically 
investigate the "print to meaning" question. In order to accomplish this 
purpose, the investigators designed a reading task in which using sound as an 
intermediary between print- and meaning was most unlikely. One group of 
fluent readers was randomly assigned to this task. Another" group was 
assigned to a task differing only in the fact that using sound was a possibility. 
The two groups were then compared.on their trials to criterion in leafnmg the 
passage's special lexicon, rate of reading, and recall of material read. It was 
reasoned that if fluent readers proceed directly from print to meaning, then 
removing the probability of a sound intermediary would have no effect on rate 
or recall. 

Mettpd 

To investigate the hypothesis that fluent reading proceeds directly from 
print to meaning, a task was constructed which should preclude the reader s 
use of sound as an intermediary. Six pictures of fish were selected and names 
were made up for these fish. The names (doff it, dulmet, mintex, mastib, pontud 
and pemtad) Were all six letter, two-syllable, pronounceable words. 
(Pronounceability was checked by asking three adult readers to say the words. 
Their pronunciations were immediate and consistent.) Two words were 
constructed for each first letter so that subjects could not discriminate on the 
basis of first letter only. From each of the pronounceable names, an 
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■ unpronounceable name was constructed by changing the positions of the 
second and third letters and the fifth and sixth letters. The unpronounceable 
words (dfofti, dlumte, mnitxe,, msatbi, pnotdu and pmetda) were verified by 
three adults as being unpronounceable. 

Next a passage was written about these fish. The passage of approx.mately 
450 words described imaginary properties of the six fish: 

" " "TeS^S Should easily be called the ^Six Wonders of,he Sea 
World " These wonderous fish are the mintex, the doHrTT the pontud. the 
dulment the pemtad, and the mastib. 

Thl mintex for example, is the fastest fish* in the world. Some .mes rt [has 
been timed swimming as fast as 40 miles per hour. In a race with a mintex 
. any otheMish would lose. "Mintexes race eacn other sometimes as if they 

We {he S dSs e da r imto fame is just the opposite of the mintex. The doff it is the 
slowest fish in the world. It has been known to remain in the same place in i the 
water for hours. The doffit doesn't have to worry, though, abou not being able 
to run away f rom.its enemy. The doffit tastes so bad that no other fish wil, try 

i f\ poj it 

The pontud is a famous fish because of the way its body lights up when it 
falls in love with another pontud. Scientists have tried to figure out how to 
make flashlights which work on the same chemical that ttje pontud has but 
they have not succeeded. It is a beautiful sight, to see two pontuds, swimming 
alonq blinking at each other! . 

The dulmet is not an unusual fish because it is fast or slow or lights up. but 
because d how delicious it is to eat. Dulmet meat sells-for $25 per pound,n 
the ( Stores where you can find it. Kings and Queens have long considered 

dulmet a royal dish. Dulmet is so good that some people eat it raw! 

■ The pemtad is the world's most dangerous fish because it has a poisonous 
bite. Pemtad poison is more deadly than rattlesnake venom and there is no 
antidote for it. Pemtad fishing is against the law in some countries because ot 

^pSSTereisthe mastib fish. The mastib fish is so unusual becratlw* 
is nothing it would rather do than be caught by a fisherman^ No bait s 
neceSy to capture * mastib. They run straight toward a hook or net. 
mpphipqq to sav there ar&n't many mastibs left! • 9 

Sow wheS t's the speedy mintex. the slowpoke doffit, the flashy pontud 
, the d^icioSs dulmet. the poLnous pemtab or.the easy-to-catch mastib. you II 
have to admit that here are so^ne pretty fishy fish! 
Another passage was constructed by substituting the unpronounceable fish 
names for the pronounceable names. Thus the first paragraph of the 
unp^onounceable P fish passage read: "There are six fish which cou d i eta* bp 
called the 'Six Wonders of the Sea World.' These wonderous fish are the 
mnitxe the dfofti the pnotdu. the dlumte. the pmetda, and the msatbi 

The study was carried out with a group of 47 fifth and sixth grade students 
who read at or above grade level and with a group of 14 graduate studen s in 
Read no Education Frfth and sixth grade students were used because they 
we? ? though to be the youngest good readers who were well beyond any 
-dlcod ng stage which uses serial "processing of words. A gratified random 
sample was achieved by randomly assigning the elementary and graduate 

^ «3J» ■ g» 9 S P sige ne T^ ZZ^Z ^ "toe 
the pronounceable fish passage, me uinei. y'~ u ^„ 

unpronounceable words and read the unpronounceable »sh passage^ 

iinfnrtnnatPlv because there s a basic confodnding ot a wora s 

pronounceabH ty and i?s olographic legality (Gibson & Levin,. 1975 the two . 

passa'gefdiSered not only on the wanted variable of pronounceab.l.ty versus 
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unpronocnceability but also on the unwanted variable of orthographic legality 
versus illegality. To diminish the Influence of the unwanted variable without 
rendering the groups unequivalent, a treatment was designed 'which would 
teach each subject to associate the picture of each fish with the name as a 
chunk or as a sight word before the subject was asked to read the passage. 

The training procedure for both groups was identical. Subjects were 
individually taught to associate the pictures of the fish with their names. In 
neither group were the names pronounced. Rather the fish were spread out, 
the appropriate name was placed next to each fish and subjects studied the 
names and the fish until they thought they could match ttiem. Subjects 
continued the matching trials to criterion until they had successfully matched 
the fish and their names twice. Ten trials was set as the maximujn number 
before excluding a subject from the experiment. Six of the subjects in the 
unpronounceable group were unable to successfully match the fish and the 
names after ten trials. Three subjects in the pronounceable group were unable 
to match them. In order to be sure .that the two groups who read the passage 
were equivalent, three subjects in the pronounceable group who had had the 
greatest number of trials were excluded from the analysis. 

When the subjects had successfully matched the fish and the names twice, 
'they were giyen the appropriate fish passage s to re$d. Subjects were told to 
read the passage so that after reading^they could^point to the fish that had 
particular traits. Subjects were timed as-they read*the passage and "asked to 
read at their ' 'normal speed. 1 1 

Upon completion of the passape,,the pictures of the fish were spread out 
and subjects were, a$ked to point to the fish which had a particular 
characteristic. They were given no feedback as to the correctness or 
incorrectness of their responses. NexUhe names of the fish were spread out 
and subjects were asked to perform the same task with the names rather than 
the pictures. Finally, subjects were shown each name and asked, "What is 
this?" Their responses to this question were classified as semantic ("a fish" 
or "the fish whose body lights up when it falls in love" or "that big striped fish" 
f ef exarmple) or acoustic (a pronunciation of any kind for the fish). 

Results 

The difference between the mean trials to criterion for the pronounceable 
group (4.75) and the unpronounceable group (5.56) was analyzed by a* one- 
tailed t test for independent sample^. This difference approached but did not 
reach conventional levels of significance (p < .07). 

The differences between groups on the time required to read the passage 
and the two comprehension variables were measured by a two-way 
multivariate analysis of valance with pronounceable versus unpronounceable 
as one factor and age ate^^second factor. The multivariate F for the 
pronounceable versus unpronounceable factor was significant. To illuminate 
the reasons for. this significance, univariate f's for the three dependent 
variables were examined. There were significant differences between the 
pronounceable and the unpronounceable groups on the amount of time 
•required to read the passage, F (3.43) = 6.51, p < .05, and on the name 
comprehension variable, F (3.43) = 5.65, p < .05 (see Table 1). Although the 
picture comprehension results favored the pronounceable group, this 
difference was not significant. 

For all analyses, there were significant differences as expected between 
the graduate students and the elementary students. Graduate students 
learned the names of the fish in significantly fewer trials, read the passages in 



Table 1 



Cell M«ans for Three Dependent 
Variables across Two Factors. 



Graduate Students 



Elementary Students 



Pronounceable 
Unpronounceable 



Time 
in . 
Seconds 

115.4 
136.7 



Compre- Compre- 
hension hension 
Picture Name. 



5.1 
2.9 



4.3 
2.7 



Time 
in 

Seconds 
134.1 

159.3 



Compre- Compre- 
hension hension 
Picture Name 



1.9 
1.9 



2.6 
1.7 



significantly less time and performed significantly better on both the picture 
and the name recall task. There was not a significant multivariate interaction 
effect 

The responses to the "What is it?" question for the elementary students 
were clearly affected by which names (pronounceable or unpronounceable) 
they learned For the students in the pronounceable group, 13 consistently 
gave an acoustic response to the "What is it?" question. Only four students 
consistently gave a semantic response. For the unpronounceable group only 
six students consistently gave an acoustic response, 12 gave a semantic 
response. For the graduate students these results were more- equivocal. Two 
of the seven students in the pronounceable group gave an acoustic response; 
three of the seven students- in the unpronounceable group gave an acoustic 
response This difference may be attributed to the way in which the question 
was 0ut to the graduate students. The elementary students responded 
unhesitatingly to the "What is it?" question. The graduate students, however, 
were unwilling to make a response without further clarification; To clarify and 
to provoke a response, the investigator followed up the "What is it?" question 
with a "How do you have-it stored?" question/Consequently, the two groups 
. were in reality responding to two different questions. 

Discussion 

», * 
The rationale for this investigation stated at the beginning of this paper was 
that if readers proceed directly from print to meaning without a sound 
intermediary, removing the probability of a sound intermediary would not 
result in significantly different rate and recall for the subjects who lacked this 
intermediary. There is evidence from the differential responses of the 
elementary students in the pronounceable and unpronounceable groups to the 
• "What is it?" question that the two groups did indeed learn the pronounceable 
and unpronounceable stimuli differently. While the number of trials to criterion 
was greater for the unpronounceable group, this difference was not 

^The^esults of this investigation do not support the hypothesis that readers 
proceed directly from print to meaning. Subjects in the pronounceable group 
read the passage in significantly less time than did subjects in the 
unpronounceable group. The magnitude of this difference is appreciated when 
one discovers that the elementary students in the pronounceable group.took 
less time on the average to read the passage than did the graduate students in 
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the unpronounceable group. Subjects in the pronounceable group also 
achieved significantly greater recall las measured by the name recall task. 

However, there is sorVie observational data from this study to support the 
print to meaning hypothesis. Subjects in both the pronounceable and* 
unpronounceable groups appeared 1o enjoy the story as they were reading it. , 
They chuckled occasionally gnd late! asked if "there Were really fish like 
that." In addition, the^graduate students in the unpronounceable group 
seemed genuinely shocked when they were unable to perform well on the 
recall tasks. These observations lead us to hypothesize a third alternative to 
the "print to meaning'T'print to sound to meaning" controversy. 

Readers may indeed comprehend directly from print to meaning. However, 
because short term- memory appears to require auditory rehearsal for 
chunking into long term memory, recoding from meaning into sound may be 
necessary for recall of what was comprehended. In short, fluent readers 
proceed from "print to meaning to sound to memory." (Wethink!) 
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. A partial validation of the kernal distance theory 
for readability- 

Most simple readability formulas take two majn variables into account: 

1. Vocabulary difficulty as determined by syllables or unfamiliar (low 

frequency) words; and . 
2 Syntactic complexity as determined by sentence length. • 

• Yet some experimentation and theory give indications that it may be poss W . 
that tv^o sentences of identical length and identical vocabulary can have different 
difficulty or readability levels. For example, sentence A may appear more, 
difficult than sentence B: ,. 

A. The fisherman, after climbing down the hill, was surprised to see the small 

B. The fisherman was surprised to see the small dir'y boat after climbing 

Hence^bvTaking more or other syntactic variables into account it might be 
possible to develop a more accurate readability formula, at least for research 
pCSS2s.ll nor Poetical application. Another possibility for ^useo 
such information is instructions to writers who are trying to s.mplrfy their , 

"sSESS* such as those done by Coleman (1964; ^5) indicate tha^ 
"embedded" sentences such as .Example A where the subject and verb are 
split by "distance" in form of a phrase are harder to read than non-embedded 
sentences A linguistic theory proposed by Yngve (1968) suggested that 

• See" (a word or phrase) coming before the subject anj verb would 
make the sentence more difficult than the same distance coming after the 
Sect and verb. Using this information and. a small I sea e pilot study, .the 
senior author proposed the Kernal Distance Theory (Fry. 1 975) which stated. . 

that the nearer the kernel is to .the beginning of the sentence, the 
easier the sentence, and the less distance between elements of the kernel 
the easier the sentence. For the purposes of this theory we are call ng 
the kernel the subject element, the verb elementr and wben present, the p 
object element. . . There is a minor and further refinement of the Kerne 
Distance Theory which states that the distance between the verb and object 
causes less difficulty than the distance between the subject and verb^ 
Distance can be defined as number of words, , though in practice it is 
•often the embedding of a phrase or clause." . tWc 

It might be noted that the .term "distance" is somewhat unique to this 
study and is defined as a word or phrase. The term "kernel", refers to the 
subject, verb, and sometimes object, of a sentence and may not carry the same 
interpretation as used by Chomsky (1 957; 1 965). 

Method'and Results of Two Studies 
This paper will report the results of two different investigators to verify the 
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Theory on four different populations using four different measures as dependent 
variables. 



.First Study 

Method. DePierro (1 976) developed 1 6 pairs of sentences, four for each part of 
the theory (see Table 1) and asked 30 college students and *30 elementary 
pupils to individually read them. He measured: 

1. Ability to cecal I the exact words in the sentence; 

2. The time it took to read silently the complete sentence; % 

3. The delay between the end of the silent reading of the sentence and 
the onset of oral recall. 

Sentence pairs were written so that each pair contained the same words and 
only syntax was altered according to hypotheses. A balanced rotated design was 
used to cancel out order effects. 

The elementary pupils had reading abilities between 5.0 and 7.0; there 
were no remedial readers/ The" 14 male and 16 female college subjects 
scored Between 7 percentile and 94 percentile on the Nelson Denny Reading 
Test, and hence represented the full range of students found in college 
settings.. The test of significance was the Wilcoxon f. 

Results. DePierro found that splitting the subject and verb (D2) often signifi- 
cantly increased sentence difficulty. See results for H02 and H°3 in Table 1 . 
However, he found no support for a difference between, the subject-verb 
split and the verb-object split. With the college population only he found a 
difference contrary to the theory that distance after the kernel was more 
difficult than distance before. 

Second Study 

** 

Method. In the second study, Weber (1977) using good and poor readers in a 
junior college, attempted to replicate DePierro* s findings using> a different 
dependent variable. She used student judgment of sentence pairs as to which of 
the two sentences was most difficult. A group test was constructed, using 
DePierro' s 16 pairs of sentences and another 16 pairs of sentences of her own 
to make the group test more reliable by increasing its length. 

The 52 poor readers were students who scored below 43 on the Comparative 
Guidance and Placement Program battery of tests developed by Educational 
Testing- Service and were placed in three Basic Composition classes. The 53 
good readers scored above 43 on the same test and were placed in three 
Language and Literature classes. The test of significance was the Chi Square. 
Results. Weber basically confirmed DePierro's findings; distance between the 
subject and verb was judged more difficult than the same distance either before 
or after the kernel. No support was found for a difference between distance 
before or after' the kernel. Nor was there a difference forjjistance between 
the subject and verb, and the distance between the verb and the object (See 
Table 1). 



The Kernel Distance Theory, aa tested by these studies, ca^i now be 
simplified to six words, DON'T SPLIT THE SUBJECT AND VERB. Stated a bit 
, more technically, distance between the subject and verb significantly increases 
the difficulty of a sentence over one with the same amount of distance 
occurring either before or after the kernel. These differences showed up' on 
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Table 1 



Summary of Agreement With Hypotheses of the 
Kernel Distance Theory 



Fry Hypothesis Webera QePierrob 

32 Sentence Pairs 1 6 Sentence Pairs 



College Elementary College 

Poor Good Total , N = 30 N=30 
N = 52 N = 53 N=105 



Ho 1 


D, 


-> 


D 4 


+ 




Words - 
Time + 
Delay - 


- .01 

- .05 4 
+ 


Ho 2 


D 2 


> 




+ + 


*+.05 


Words + .05 
Jime + .0.1 
Delay' + . 


+ .05 
+ 




D 2 


> 


D, 


+ .05 + .05 

( 


+ .05 


Words +.05 
Time + .01 
Delay + 


+ .05 
+ 


Ho 4 


D 2 


> 


D 3 


+ 




Words - 
Tipne + 
Delay - 





a Rus sign ( + ) means agreement with hypothesis using judgment, sig. where 
indicated by X 2 ,p value. * 

bpius sign ( + ) means agreement with hypothesis on three measures, sig. 
where indicated by Wilcoxon f, p value. 



Subject Verb Object 
Di D 2 D 3 D 4 



four different measures of difficulty with four different populations (though not 
entirely consistently). 

This finding is in agreement with the embedded sentence study Qf Coleman 
(1964) who had college students memorize t>airs of sentences from a memory 
drum. It also agrees with the Hamilton and peese (1 971 ) listening comprehension 
study of center-embedded sentences. And 'it is in accord with the syntactic 
measures of Marcus (1 971 ) and Botel and Gr4nowsky (1 972)! 

The other parts of the Kernel Distance Theory are not confirmed nor does 
the data reported here, confirm Yngve's theory concerning the memory burden 
difficulty supposedly caused by placing a word or phrase at the beginning of a 
sentence 

However, there -was some tendency for all hypotheses -to be seen as 
• positive. Weber's poor readers all performed in the direction hypothesized and 
DePierro's elementary pupils on the time measure tended 7 to support the 
hypotheses. If any future research is done on these hypotheses, it is 
recommended that it be done with poor readers. 

It is doubtful that the finding of these studies will modify the practical or more 
popular readability formulas, but it might have some bearing on future readability 
research. 

These studies do have one very practical outcome. Writers should avoid 
subj£Ct-vferb splits if they wish to simplify their writing. 

that the nearer the kernel is to the beginning of the sentence, the - 
easier the sentence, and the less distance between elements of the kernel,' 
the easier the sentence For the purposes of this theory, we are calling 
the kernel- the subject element, the verb element, and when present,- the 
object element There^s a minor and further refinement of the Kernel 
Distance Theory which states that the distance between the verb? and 
object causes less difficulty than the distance between the subject and 
verb Distance can be defined as number of words, though in practice, if 
is often the embedding 6f a phrase or clause " 

Results DePierro found that splitting the subject and verb (D 2 ) often- 
' significantly increased sentence difficulty See results for H 0 2 and H- 0 3 in Table 1 .. 
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Fourth graders ' comprehension of story structures 
under three recall conditions 

Text or connected discourse, can be viewed at two leyete: (1) the micro- 
structure level and (2) the macrostructure level (cf. van Dp, 1977). At the 
microstructure level, one is concerned with understanding how .lexical, 
propositional and sentential variables influence text comprehension. At the 
macrostructure level, one is concerned with understanding how sets ot 
propositions and sentences are structured relative to one another and how this 
" relative structuring influences text comprehension. 

For the most part reading research has focused on the microstructure level, 
investigating word recognition variables and sentence comprehension variables, 
and has paid little attention -to the macrostructure level. Yet, as studies in 
other nonreading disciplines have demonstrated, it has become increasingly 
apparent that models of word recognition or sentence comprehension cannot 
account for many of the factors, influencing children's comprehension of written . 
text Although theories of written text comprehension must eventually explain * 
how these macrostructure units influence children's reading comprehension of a 
text's macrostructure, the significance of macrostructure variables must be . 
clearly understood befo.e this relationship can be explored. 
Researchers investigating how people comprehend story macrostructures 
' (e.q ' Glenn 1 978; Mandler & Johnson 1 977; Stein, 1978; Stem & Glenn, 1 978, 
Stein & Nezworski, 1978) have assumed that there exists an ideal story 
macrostructure which can be described in terms of a specif ied ordering of story 
categories, logically andjemporally related. For example, Mandler and Johnson 
(1977) have defined the ideal story macrostructure as being comprised I of 
four categories: (1) Setting, (2) Beginning, (3) Development, and I (4 En^jgjhe 
Settinq is a group of propositions or sentences which accomplish the following^ 
They introduce the main characters of the story, they describe the time and 
local of the story, and they provide additional information the reader needs to 
know to understand the events thaf follow. In the Beginning, something 
causes the main character to respond in some way (This response often 
result* in some type of displacement from the character s normal routine^ 
The Development describes the actions a character uses to obtain some 
explicitly or implicitly defined goal.' Finally the Ending describes the conse- 
• quences of the effect of the gbal either haying~been achieved or not achieved 
A second assumption of story grammarians is that these macrostructures have 
a real-time psychological counterpart existing in mind; these* grammarians have 
called these mind macrostructures ••schemata/' In sum, story schemata consist 
of sets of expectations about stories, about the categories of which they are 
composed, the way in which these categories are sequenced and the > types _of 
connections betWeen categories that are likely to occur (cf. Mandler, 1978, 
Stein, 1978; Stein & Glenn, 1978). 

The question of how children employ schemata to comprehend written text is 
unresolved for several reasons. First, the early work of Piaget (1926/1960) 
and Fraisse (1 963) suggests that young children lack a framework for organizing 
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the events of a story. Both Piaget a^Rd Fraisse reported temporal inversions 
\n young children's recall and assumed the inversions resulfed froqn an inability 
to establish a framework for organizing chronological, casual, and deductive 
relations. Such findings suggest that children do not have the. benefit of well 
defined schemata for organizing events. 

In contrast to these findings, other findings (e.g. Brown, 1 9757 1-576; Stein, 
1978; Stein & Glenn, 1978) have noted that chHdren do, in fact,, organize 
narrative information according to weH defined schemata. For example, in one 
study, Brown (1975) tested kindergarten and second-grade children's recognition/ 
reconstruction, and recall of simple narrative sequences. For all children, logical 
sequences were better retpined than were arbitrary sequences. The difference 
between the two groups was that although kindergarten children could 
recognize and reconstruct the correct sequences, they had 'difficulty in main- 
taining the Correct order during recall. Second-grade children, pn the other hand, 
were able v to establish the correct order of the events no matter what the 
method of testing was. 1 \ g * 

In addition to the problem of su'ctvcontradictory findings,- two other problems 
with studies investigating children's* se of schemata in comprehending stories 
stem from the conditions and procedures used to operationally define story 
comprehension. In terms of conditions, most research has focused on children's 
comprehension of orally presented stories. The question of how children com- 
prehend written stories remains largely unexplained. Although many researchers 
have adopted the assumption that processes underlying listening comprehension 
are the same as those underlying reading comprehension (esg,-, Soodman & 
Goodman, 1977), other researchers have demonstrate^ that this assumption is 
neither tenable at the microscopic nor the macroscopic 'level of processing 
(e.g., Hildyard & Olson, 1 978: Mosenthal, 1 978; Thorndyke, 1 977). 

In'terms of procedures, the most/typical way story comprehension is defined 
is by having subjects recall a story j/nmediately after presentation. Two measures 
of organization in recall are usually derived from this operational procedure. The 
first is the relative salience of categories. As shown by Glenn (1978), Stein 
and Glenn (1978), and Mandler and Johnson (1977), categories vary in terms of 
their salience or structural Importance in recall. In particular, Beginning and 
* Development categories tend to be recalled more often than Setting and Ending. 
A second measure of organization focuses on the way the categories are 
grouped in recall. According to story grammars, events should be recalled in 
the sequence of Setting, Beginning, Development, and Ending. 

Studies,' such as Brown's (1975), have demonstrated that under free recall 
conditions, young children tend to use a schema organization for remembering 
and ordering events* On the other hand, under such conditions as recognition and 
forced recall, children tend to episodically remember information without the 
benefit of organizing schemata. In short; whether children would use a schemata 
to order episodes from a story in operational procedures other than free 
recall in unclear. r 

Another poinUwhich needs investigation is whether children's schemata for a 
stdry changes over time. Because most studies of children's comprehension of 
stories focuses on immediate recall, it is unclear whether this schema structure 
# prevails over time. Research is divided on this point. On the one . hand, there 
are those researchers (e.g., Mandler, 1978; Spiro, 1977) who argue that over 
time, children will reconstruct narrative^sequencfes so that they are assimilated 
to new text structures, based upon redefined schemata. On- the other hatid, 
there are those researchers (Cofer, 1943; Rubin, 1977; Squire/Chase, & Slater, 
0 1 972) who argue that the structure of narrative sequences is left in tact due to' 
superficial, or reproductive processing. In sum, how pervasive children's 
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• story schemata are in organizing related narratives into a larger schema structure 
has been poorly addressed 

The purpose of this study was to investigate the function which schemata 
play in fourth grade children's reading comprehension of stories under different 
comprehension conditions. The three different comprehension conditions 
included: immediate recall, delayed recall, and immediate forced recall. If 
children do, in fact, use schemata to comprehend stories, as sto*y grammars 
predict, then one would expect children, in an immediate recall condition, to 
consistently recall more information from some story categories than from others. 
Secondly 'if schemata are the principal means by which children organize and 
interpret narrative events, one would predict that the. most saliently remembered 
categories irt free recall-would be the most saliently remembered categories in 
<he forced recall. Finally, one would predict that if children's schemata serve 
to direct reconstructive, long-terrfi remembering, then four complete stories 
which, when put together form a complete new story, will be remembered in the 
\, order of saliency of the categories they represent. In other words, if four 
stones, when put together, comprise four categories of a well formed new story 
(these categories being, namely Setting, Beginning, Development and Ending),- 

< then one would predict that in a delayed recall of these stories, the stones 
most saliently recalled would be those which comprise categories most 
saliently recalled in the individual story recall conditions. These hypotheses 
were tested in the following experiment. 

Method 



Subjects 

Thirty students were randomly selected from two fourth-grade classrooms in a 
large suburban school near Albany. New York. All students participating in 
the experiment could read and respond to the stimuli in the manner required. 

Matenals , . 

A story from a fourth-grade social studies text was rewritten into four 
smaller stories following four different episodic parts of the original story. Each 
of the four stories was written to conform to the story-grammar tree structures 
of Mandier and Johnson (1977). Each story structure consisted of four categories: 
the Setting, the Beginning, the Development, and the Ending. There were three 
Events in the Setting, three in the Beginning, four in the Development, and one in 
the Ending Each event within and across all four stories consisted, of eleven 
propositions, as defined by Kintsch (1974). The words' in the propositions 
within and across all four stones were controlled for in terms of frequency 
(Kucera & Francis. 1 967) and imagery value (Paivio, Yuille, & Madigan, 1 968). 

Procedure . ■ . 

The thirty students read each of the four stories. After reading a story, each 
student was first required to write ail that he could jemember about the 
story After the written recall, each student attempted to complete a cued recall 
test This test required that the student fill in deleted information in eleven state- 
ments Each of the statements corresponded to information in each of the eleven 
events in a story respectively. All students were told that they should read 
the story carefully since they would be asked to write about each story and 
answer several questions. In addition, all students were informed that they would 
be asked to write about the story one week later. The students read and 
competed the tasks for each of the four stories on consecutive days of one 
week Eight days later, all students were asked to recall'as much as they could 
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about all four stories. The students were tested in groups, wifh each group 
receiving similar instructions. Instructions for the free recall were, "Write every- 
thing you can remember about the story you have just read. Try to remember 
everything, including the words the author uSed." For the delayed recall, 
subjects were instructed to "Write everything you can remember about the four 
stories you read last week. Try to remember everything, including the words that 
were in the story ." 

Scoring 

Responses to the free and cued recall tasks were marked by two raters. 
Acceptable responses included propositions which were literal reproductions 
from the text stimuli or.else were synonymous with predefined text propositions. 
In the immediate free recall, four scores were tallied for each story. These 
scores included the number of propositions recalled in the Setting, the Beginning, 
the Development, and the Ending. Similarly, in the immediate cued recall, the 
number of propositions correctly recalled from the Setting, Beginning, Develop- 
ment, and Ending, were likeways tallied. In the delayed recall, the number of 
propositions correctly recalled from each of the four categories frOm each of the 
four stories was also tallied. 

Results 

'In the following analyses, in order to compensate for the fact that some story 
categories had more events and propositions associated with them, recall' and 
, recognition were calculated in terms of the mean proportion of propositions 
recalled and recognized per category. The mean proportion of propositions re- 
called per category was calculated by multiplying the number' of events per 
category times the number of propositions per event and dividing this product 
into the number of propositions recalled per category. In determining mean 
proportion of propositions recalled per category, the number of cued recall 
questions correctly answered was divided by the number of events per category. 

The first hypothesis was that It children use schemata to comprehend stories, 
then children should consistently recall more propositions from some story 
categories than from others. An Analysis of Variance of the mean proportion 
of propositions in basic story categorie3 recalled immediately after reading the 
text (see Table 1) supported this hypothesis, F(3 t 11 6) =19.11, p<.01. No 
interactions proved significant. This hypothesis was further supported by a series • 
of Scheffe'pairwise comparisons between the Setting and the other categories 
within the stories; all Scheftes / were significant except for the pairwise com- 
parison between the Setting and the- Beginning in Story 2. Propositions from 
the Setting and Beginning were recalled significantly more frequently than from 
the Development; propositions from the Development were recalled significantly 
more frequently than from the Ending across all four stories, with the exception 
noted in Story 2. 

The second hypothesis posited that if 5>chemata are the principal means by 
which children organize and interpret narrative events, then the most saliently 
remembered categories in free recall should be the most saliently remembered 
categories in the forced recall.. This hypothesis was supported by the fact that 
an overall Analysis of Variance of the mean proportion of cued recall statements 
correctly answered was significant, F(3, 11 6) = 17.7, p <01 (see Table 1). No 
interactions proved significant. A series of Scheffes proved significant for all 
pairwise comparisons, except- between the Setting and the Beginning in Story 2 
and between-the Beginning and the Development in Stories 3 and 4. The fact 



Table 1 



Saliency of Category and Story Recall over 
Four Stories Recalled under Three Different Conditions 



Saliency (measured as mean proportion of propositions cbrrectly recalled) of 
Story Categories in Immediate Recall 





Setting 


Beginning 


Development 


Ending. 


Story 1 


.72 


.51 


.43 


.22 


Story 2 


.67 


.58 


.48 


.17 


Story 3 


' ,61 


' .49 


.39 


.16 


Story 4 


.69 


.53 


.42 


" .20 


Saliency. (measured as mean 


proportion of recall questions correctly answered) of 


Story Categories jn Forced Recall 








- Setting 

„ % — : 


Beginning 


Development 


Ending 


Story 1 


.69 


.58 


24 


.20 


Story 2 


.58 


.53 


.39 


.12 


Story 3 


. .59 


.42 


.37 


.19 


§tory 4 


.63 


.47 


.39 


.23 


Saliency 


(measured as mean proportion of pr 


opositions correctly recalled) of 


Story Categories in Delayed Recall 








Setting 


Beginning 


Development 


Ending 


Story 1 


.42 


.21 


.29 


,34 


Story 2 


.31 


.10 


.18 


.21 


Story 3 


3 

.17 


.07 


.12 


.18 


Story 4 


.05. 


.02 


.10 


.12 



that the saliency of the categories in the immediate and delayed recall con : 
ditions was similar supports the contention that children by fourth grade use the 
same schemata for comprehending stories under varying operational conditions. . 

The third hypothesis was that if children's schemata serve to direct recon- 
structive long-term remembering, then children should recall four stories, serving 
as a Setting Beginning, Development, and Ending, respectively, to a super- 



ordinate $tory, in the same manner as they recalled the Setting; Beginning, 
Development, and Ending in the immediate recall stories. An Analysis of Variance 
of the number of propositions recalled from each stdry revealed an overall 
significance between stories, F(3, 11 6) = 21:82, p t <01 (see Table 1). A 
series of Scheffe'pairwise comparisons revealed that Story 1 (i.e. the Setting) 
was recalled significantly better* than Story 2 (i.e. the Beginning); Story 2 was 
recalled significantly better than Story 3 (i.e. the Development); and Story 3 was 
recalled significantly better than Story 4 (i.e. the Ending). These results were 
similar tg those of the immediate recall findings that demonstrated a saliency 
order, from most to least, of Setting and" Beginning, Development, and Ending. 
A sene£ of Scheffes revealed a significantly different saliency preference for 
recalling categories within each story jn.the delayed recall condition. The order 
of saliency preference for each of the stories was Setting, Ending, Development, 
and Beginning. , 

. . The uniqueness of this final finding is f urt6er compounded when one compares 
the recall order of categories within the stories in the immediate and delayed 
recall conditions and between stories' in the, delayed recall condition. In the 
immediate recall condition, the Setting wasr recalled first 96% of the time, 
the Beginning was recalled second 97% of the time, the Development was 
l - recalled third 94% of the time, and the Ending was recalled fourth 99% of the 
time. In the delayed recall, between stories, Story 1, as Setting, was recalled 
first 97% of the time; Story 2, as the Beginning, was recalled second 98% of 
the time; Story 3, as the Development, was recalled third 94% of the time; and 
Story 4, as the Ending, was recalled fourthr99% of the time. The recall order 
of these categories was , in keeping with the order story' grammars predicted. 
However, in the delayed 'recall within stories, the Setting was recalled first 94% of 
the time, the Ending vyas recalled second 86% of the time, the Development 
was recalled thfrd 95% of the time, and the Beginning was recalled 'fourth 92% 
- of the time (A series of AMOVAs performed on the recall percentage of 
, categories under each condition all proved significant, p <. 01) This order of 
category recall -was not in keeping with the order story grammars predicted . 

Discussion 

The findings of this study support the notion that chijdren employ schemata in 
comprehending written stories much the same as they do in comprehending 
spoken stories in immediate recall. The nature oMhese schemata is such that 
text is processed hierarchically, with the most significant story categories being 
the best recalled (cf. McKoon, 1977; Meyer & McConkie, 1973). Furthermore, 
it appears that schemata serve to temporally order categories recalled, with 
the most salient categories being recalled first. 

These findings additionally demonstrate that fourth-grade children employ 
schemata in the same manner under both free and forced immediate recall 
conditions. 

While these conclusions are supported by the findings in the immediate recall 
condition, they are not wholly supported by the findings in the delayed con- 
cfititfn. In the delayed condition, individual stories serving as episodes of a 
larger story were recalled in the manner story grammars predicted; saliency and 
order of recall followed the predicted pattern. On the other hand, saliency and 
order of recall of categories of the individual . stories did not conform to 
the predicted pattern. The order of saliency and recall was Setting, Ending, 
. - Development, and Beginning. 

This deviation in the delayed condition is unique for several reasons. First, 
•* although the curve representing the saliency of recall resembles the typical 
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serial learning curve, with recency and primacy effects, the recall order of the 
Setting and Ending do not conform to the typical recall order in serial position 
studies In addition to the recall order finding, othfr studies fe^g Johnson & 
Scheldt 1977 Myer & McConkie, 1973) have demonstrated that primacy 
effects in prose recall is not a function of seriation. What appears" to have taken 
place is that, over time, fourth graders assimilate smaller story schemata into 
larger schemata and, in the process, reorganize the original schematic 
organization of individual story episodes. However, why this study s children 
arrived at the particular category organization they did in the delayed recall 
needs further investigation and explanation. . „i..*«™ 

These findings raise an interesting question for the teaching and eva ua ion 
of readinq Most teaching and evaluation is based upon teaching and evaluating 
children's immediate comprehension of text. Comprehension, however, is not 
only a process of a -moment but of a. life time. The question of how one 
teaches children to comprehend text episodes' so that they are effectively and 
"ideally" integrated over time' needs careful consideration. How one is to 
evaluate the effectiveness of this integration also needs to be considered. 
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Differential effects of prior context, style and • 
deletion pattern on cloze comprehension 

Psycholinguists and information processing theorists (Goodman, 1976; 
Rummelhart, 1976; Smith, .1975) suggest that the proficient reader Who 
possesses an adequate knowledge base relies/jn context and syntactic and 
semantic cues in order to comprehend re^inimaterial. Context allows he 
reader to access relevant knowledge which in turn, facilitates the 
comprehension process (Schallert, 1975) It has been demons rated that 
interpretation of entire passages can be affected by the context (e.g., the 
passaqe title, Bransford and Johnson, 1972; Schallert, 1975). 

Goodman (1976) proposes that, as readers progress through a passage, 
thev must utilize their own semantic base to recreate the author s message. 
Implicit in this model of the reading process is the idea of conceptual build-up. 
As readers continue reading, the availability of additional information, cued 
from the text, further constrains the possible interpretations of subsequent 
text In analyzing tne miscues of fourth grade children reading narrative 
material Menosky (1971) found that the quality of the students miscues 
changed as they read the first, second, and final thirds of the story. Miscues 
progressively detracted less from the author's'message. Menosky interpreted 
her results as evidence of contextual build-up in support pi the Goodman 

mod©! * 

H the cloze procedure (Taylor, 1 953, 1 956) as a measure of comprehension 
has validity, 'according to the psycholinguists models, evidence of eontex ual 
build-up should be expected. Some studies suggest that the ability to complete 
cloze is bilaterally constrained by four to five words ?V M Mm J ! 
delelon (Aborn, Rubenstein, & Sterling, 1959; MacGinitie, 1961, Miller & 
Coleman 1967). Other studies provide evidence that this ability may be 
influenced by context that extends beyond the immediate sentence perhaps 
even to an entire paragraph (Ramanauskas, 1972). No consensus has been 
reached concerning the effect of context on cloze nor has this influence been 
studied over passages longer than a paragraph. 

The purposed this study was to examine the effect of varying amounts o 
prior context on cloze performance. Consfstent with the Goodman model, it 
was predicted that cloze comprehension would be greatest for -those > who 
' received the most context prior tp the cloze passage <500r ^ordsUess or 
those who received some context (250 words), and least for the control group, 
if contextual build-upbccurred. „ 

Two other factors, style of writing (narrative or, expository) and deletion 
pattern (one-fifth or function word), were considered. Sager (1977) suggests 
that in technical material the logical argument is carried by function words. 
Due to the descriptive' nature of narrative material, the content usually is not 
organized to present a logical argument and function words serve a more 
general purpose. Therefore, it was hypothesized that function words would be 
more difficult to replace in expository material than in narrative material. One- 
fifth deletion was included so that the results of this study could be compared 
with the existing research on cloze. 
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Method 



Sample 



The 93 subjects for this study consisted of 53 eleventh-grade students and 
40 twelfth-grade students in a suburban New Jersey high school. They were 
enrolled in either academic composition or general English classes. The 
sample nrtean on a standardized reading comprehension teat ranged from the 
38th to 40th percentile. 

tyaterkls K 

Two narrative cloze passages (Kon-Tiki and 1984) and two expository Cloze 
passages (How to Read Body Language and The Good-Natured Gorilla) were 
created. On three Readability measures, the narrative material ranged from 
seventh to ninth grade level and the expository from ninth to twelfth grade. A 
one-fifth delation pattern was applied, to Kon-Tiki and Body Language. For 
1984 and Gorillas-, if the fifth word (or multiple of 5) was a function word, it was 
deleted. If not, the next function word following the fifth word was deleted. All 
cloze passages were approximately 250 words lonij allowing for 50 deletions. 
AN deleted words were replaced bV un-numbered 10-spaq^e blagks. 

Each of these four parages appeared in three prior context conditions: (a) 
only the cloze passage, (b) the cloze passage preceded by a page of 250 
words of the text, and (c) the cloze passage preceded by two pages of the text, 
250 words per page., / 



Procedure. ' ( 

This study was conducted in six 45-minute sessions over a period of three to 
four weeks. During the first session two teaching passages were administered 
and four points were stressed: 1) Only one worjd belongs in the blank. 2) 
Spelling does notpount. 3) If more than one word seems to fit the blank space, 
choose \We one you think is most consistent with the author's message and 
style. 4) Remember that cloze is a difficult task. These four points were also 
reviewed at the beginning of each cloze testing session. In,the second session 
the subjects were administered the Reading Comprehension section of the 
Iowa Silent Reading Test, Level 3, Form F ([1972). In ea'ch of the last four 
sessions, subjects completed the experimental cloze passages. 

Design * 

A split-plot design was used with amount of prior context the between-group 
variable. Style and deletion pattern were repeated within-group variables 
cbunterbalanced for passage order, grade and class. 

Passages were scored for percent of exact replacement of deleted items. 
All two-word responses, morphological variations, and omissions were scored 
incorrect. . : 



Preliminary investigations reveajed no differences attributable to sex or 
grade. Therefore, these variables were not considered further 

In a 3 x i x 2 (Content x Style * Defelion) factorial analysis of variance, no 
significant main effects of prioY context were found, F (2,90) = 0.365, p > .05. 
This result seems to lend support to the position that the cioze task forces the 
subject to concentrate on a more limited area of the text (Aborn et al., 1959; . 
MacGinitte, 1961; Miller & Coleman, 1967). Another interpretation of these 
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data from the perspective of discourse analysis is that the demands of the 
cloze task may be interfering with the subject's ability to utilL. the strong 
cuing systems of connected discourse, 'forcing the subject to attend to 
syntactic and semantic information at the sentence level. If this interpretation 
is accurate, it appears that cloze is not an adequate measure of the reading 
process. 1 . 

Although there was no overall effect, upon inspection of individual items, it 
appeared that certain types of items such as pronouns and adjectives of 
quantity were more affected by prior context. One way to identify theSte items 
would be to analyze cloze answers using the lexical marker-transfer feature 
model proposed by Finn (1977-78). Onde words of this type 'are identified, 
meaning those which are affected by prior context,, experimental passages 
could then be constructed using these words as*dQtet^d it^ms, thus modifying 
fhe cloze deletion pattern as it has been traditionally (JefirlQd. - 

Style, F (1,90 = 60.56, p < .001 , and type of deletion, F.(1, 90) = 187-3.67, p 
< .001, were both significant. However, even thougte the difference in style 
was significant, it only accounted for 3% of the total sums of sqfiares. The 
score for narrative style (M - 54:75) was significantly greater than that for 
expository style (M - 48.43). Although readability was equivalent within style, 
it was not equivalent across styles. Therefore, this difference in means may be 
a function of the different readability ranges orthe style. 

The significant difference for type of deletion accounted for almost 53% of 
the tptal sums of squares. Table 1 presents the means. In both styles, one-fifth 
cloze was more difficult than function word cloze. W 

Tabie 1 ' 
Mean Percent of Correct Responses 
on Narrative and Expository Passages 
■ - • with Random One-Fifth or Function Word Deletions 





CTX 1 


CTX 2 


CTX 3 


TOTAL M 




1/5 ^Fct 


1/5 Fct' 


1/5 Fct 


1/5 Fct 


Nar 
Exp 


39.48 66.77 
35.94 59.94 


42.26 68.71 
36.77 63.68 


41.42 69.87 
33.68 60.58 


41.05 68.45 
35.46 61.40 



It was hypothesized that there would be a significant interaction of Style x 
Deletion, that subjects would score higher on function word deletions in the 
expository style passage than in the narrative style passage. Results show no 
significant interaction, F (1,90) = 1.026, p > .05. In this study the function 
word deletion pattern was constructed to be as consistent as possible with the 
cloze literature. Inspection of the deleted items revealed a low proportion of 
words such as conjunctions which Sager (1977) notes primarily carry the 

1 The authors wish to thank reviewer 8.1 for helpful comments on this point. 
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logical argument of the text. Further research employing a deletion pattern 
based only on conjunctions might produce evidence of the^specific role of 
these functions words in expository material. 

A major 'Characteristic ot narrative style is the elaboration — the 
redundancy, the variety of vocabulary items, the use of synonyms, the reliance 
on a number of terms to describe a particular Concept. For example, in Kon- 
Tiki, wave is referred to as (a) "the wall," (b) "the great green wall," (c) "the 
mountain," (d) "the hell," and (e) "the sea. M A characteristic of expository 
material, on the other hanjd, is precision. Lexical items, particularly nouns, are 
not varied in an attempt to maintain clarity. 

A further analysis (based on gn analysis by FiHenbaum, Jones & Rapoport, 
1963) of tfie one-fifth cloze passages investigated this style difference. A 
lexical score, representing the percent of lexical items correct, and a function 
score, representing the percent of function words correct, wePe calculated 
(see Table 2). A -3 x 2 .analysts of variance (Context x Style) was performed 
using fcach of these scores, lexical and function percentages. Results indicate 
that lexical items < in narrative material were significantly mofe difficult to 
replace than in expository material, F(1,90) = 5.40, p< .05. This further 
analysis also indicated that function words in the expository material were 
significantly more difficult to replace than those in narrative material, F (1 ,90) 
= 6.56, p < .05. Even though there is a confounding of readability and style, 
these results tentatively support the conceptualization of this difference 
between expository and narrative styles. This conceptualization needs further 
testing utilizing materials of equivalent readability. Additionally, it is felt that 
this type of analysis may prove u^uUn future cloze research. 

Table 2 

Mean Percent of Correct Responses 
on One-JFifth Cloze Passages 
in Three Item Categories 



CTX1 



CTX2 



CTX3 



TOTAL M 



Kon-Tiki (Nar) 
Total 
Lexical 
Function 



39.48 
17.26 
54.30 



42.26 
20.00 
57.10 



41.42 
20.48 
55.38 



41.05 
19.25 
55.59 



Body Language (Exp) 

Total 36.00 36.77 33.38 35.48 

Lexical 22.70 23.30 18.88 ' 21.63 

Function 51.61 52.60 51.05 51.75 
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Traditionally it has been demonstrated that cloze correlates highly with 
. standardised reading comprehension tests. When one wishes to examine 
specific aspects ol the comprehension process, cloze seems less appropriate 
as a measure, since it seems to force processing to occur at the sentence 
level The fact that thetJeletion patterns have been based on statistical, not 
linguistic models may be another reason why cloze has been insensitive to the . 
natural cuing systems in connected discourse. ^ w ul MQ _ m 

Bowers and Nacke (1971-72) and Ohnmacht, Weaver and Kohler (1970) 
have proposed that deletion patterns be based-on a linguistic model. One 
promising approach to determine those linguistic items which provide 
cohesion in discourse is that of Finn (1977-7,8). Once such items have been 
identified deletion patterns based on these items can be constructed as more 
accurate measures of contextual buildup. Additional factors which most be, 
considered in developing new deletion patterns are those associated with the 
differences in style and organization oftextual material. 
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Some things the reader needs to know 
that the listener doesn't 



Much attention has been paid to the similarities between* listening and 
reading. In both teaching programs and research, it has often been assumed 
that reading comprehension ability consists of a simple combination of 
listening comprehension and print-to-sound decoding abilities. Clearly there 
are many important similarities between listening and reading, and throygh' 
experiences. with spoken languagethe child acquires muph of what he needs 
to know in order to read. However, this emphasis on the similarities between 
listening and reading has led \6 the neglect of some important differences. 
This paper wiU focus on these differences and consider the question. What do 
skilled readers (jped.to know that they would not have acquired via experience 
with spoken language? Particular emphasis wHI be placed on the listening, and 
reading tasks encountered by transitional readers, those who have mastered 
individual word decoding but still have a lot to learn before they can fluently 
conrlprehend written language. 

The differences between the written and spoken language school children 
encounter will be divided into five categories: (1) differences due to the 
existence of intonation and stress in speech but not in writing; (2) differences 
in the situations in which speech and writing are generally used; (3) 
differences in the functions speech and writing most commonly serve; (4) 
differences in characteristics such as vocabulary, syntactic complexity, and 
amount of redundancy;, and (5) differences due to the permanent nature of 
writing and the transient nature of speech. In the final section, we will consider 
the ways in which these differences interact to make the task faced by novice 
readers different from the listening tasks with which they are already ^miliar. 

Intonation and Stress 

It is obvious that speech contains intonation and stress while writing does 
not. However, it may not be immediately obvious that intonation and stress 
provide information that is useful to listeners. Intonation helps listeners divide, 
the Stream of speech into, meaningful word groups. Stress is useful in 
determining whkSh is the new or important information in a sentence. 

First, consider the use of intonation to help determine which words go 
together to form a meaningful, group. Such grpups of words -are Sometimes 
called constituents. There is a large variety of evidence that such grouping is 
necessary to understand any but the simplest sentences, whether they are 
presented in writing or in speech (e.g., Clark & Clark, 1977). Consider the 
following sentence (Graf & Torrey, 1966) divided in two different ways: 



A 

During World War II 
even fantastic schemes 
received consideration 
if they gave promise 
of shortening the confHct. : 



fi 

During World War 
II even fantastic 
schemes received , 
consideration if they gave 
/promise of shortening the 
conflict. 



A is easier \o read than S because it is divided at tonstituent boundaries. That 
is, it has been prechunked into meaningful' units. Now consider, reading this 
sentence aloud. Clearly the pauses tend to fall along the breaks given in >A, not 
those given in S. This is a well documented finding: Pauses and other 
.intonation features often provide cues to constituent boundaries. 

The reader also must determine which 0 words go together to form 
meaningful groups. How can this be done without intonation? Punctuation 
provides some help, but is, generally not sufficient; mo§t constituent 
boundaries are not marked by any punctuation. The reader must depend on 
syntactic and semantic cues to constituent boundaries. For example, some 
types of words, such as determiners (a, the), quantifiers (some, all, many), and 
definite pronouns (I, you, she) usually occur at the beginning of a constituent. 
The use of syntactic cues such as these seems to requke more complex 
knowledge and processing than the use of intonation cues; they simply are not 
as obvious as intonation. This may cause difficulty for novicfe readers in ' 
determining the constituents of written sentences, and therefore in 
comprehending them. 

Stress provides cues useful in separating the new or focal information 
contained in a spoken sentence from the less important information (Bolinger, 
1972). For example, consider the following sentences spoken with the 
capitalized word stressed: 

C. JOHN stole the picture. ; ^ * 

D. John STOLE the picture. 

E. John stole the'PICTURE. - V 

In each case the sfressed word would Jbe the one carrying the new 
information. That is, sentences C t D and E could be answers to questions C\ 
D' and E' , respectively. 

C Who stole the picture? v * 

D' What did John do with the picture? 
E' What did John steal? 

The stress on the new or important terms ptovides the listeners with useful 
information. Readers must compensate for the lack of stress cues and are 
forced to rely upon less obvious cues to identity the important information. 
Readers must make greater use of syntactic cues (compare It was John who 
stole the pictyre with It was the pictute that John stole). Atso, readers must 
make greater-use of information from previous parts of the text and worn their 
own knowledge to determine which is thfc important or new information. Again, 
the lack of cues which are : available in speech forces readers to use more 
complex knowledge and processes than listeners, and this may present 
problems for novice readers. ' * \ \ 



• Differences in th* Situation's in which Speech and Writing.are Used 
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Anything written can be read aloud and anything spoken can be written 
down. However, the two modes are by no means interchangetjle. Some 
situations and purposes call for spoken communication and others for written. 
For example, speech is most commonly found in situations where the 
communicants are in the same place. Therefore, speakers and listenecs often 
share a mutual nonlinguistic context and are able to interact with each other. 
Writers and readers generally do net share a common noplingujstic context 
and generally cannot interact with each other. A shared context would often 
facilitate understanding and we know that young children rely a great deal on 
context to help them understand speech. AIsq, when the communicants can 
interact the speaker San take into account what the listener already knows, 
and doesn't know, and the listener can direct the speaker by asking questions; 
making comments, or just looking puzzled. These differences ate considered 
' further in the paper by Rubin in this volume. 

'Differences in the Functions of Written and Spoken Communication 

• 

Speech and writing also differ in their usual functions. There is a strong' 
tendency for speech to be used for informal'sociah communications and 
* writing for the more forma! communication of information. This difference may 
result in difficulties for novice readers in two ways. First, the reading- tasks 
they face may often assume knowledge that would not be necessary to 
understand the spoken language they usually encounter. That is, the 
acquisition of much new knowledge and the extension of existmg knowledge is 
necessary for successful reading. Secondly; interpersonal communicatipn 
may be much more motivational than informational communication. Many 
children may lack motivation to work at understanding the abstract, formal, 
detailed language often found in writing. 

Differences in the Language LJsed in Speech and Writing 

Studies have found that the actual language used in writing tends to differ 
from language used in speech in a variety of characteristics. DeVito (1965) 
compared samples of the writing and speaking of teh speech professors on 
topics of professional interest He found the writing contained longer and less 
common words,. as well as a larger diversity of wortif. Similar-, studies have- 
found that writing tends to be less redundant than speech: Speakers often 
repeat themselves, either verbatim or in paraphrase (Walker^.1975; Horowitz & 
Newman, 1964; Wilkinson, 1971). Furthermore it has'been proposed that 
writing tends to be syntactically more complex (as indicated, for example, by 
frequency of subordinated and conjoined' Clauses) atfd more Retailed and 
precise than speech (Horowitz & Berkowitz, 1967; Wilkinson, 1971). It has also c 
been suggested that certain types of, complex discourse Structures may be 
more natural in writing (Danks, 1 974). 

If these differences hold for the speech and writing children encounter they 
would entail deferences in the knowledge necessary for successful reading 
and listening. The novice reader may well face more complex vocabulary, 
sentence syntax, and discourse structures ^than he ' had previously 
encountered in speech, and therefore would need to extend his knowledge in 
these areas. He also must adjust to the greater detail and precision found in 
writing, and to learn to take advantage of the permanence of writing to 
compensate for its lack of repetition. 
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Using the Permanence of Written Text 

The permanence of writing provides readers with some very useful options 
not available to listeners Readers can proceed at their own rate* while 
listeners must follow the material as the speaker presents it (although Jhis may 
often be compensated for by the option of interacting with the speaker). 
Furthermore, the pace is not simply set and then maintained throughout the 
text Skilled readers slow down for important or confusing passages and speed 
up for easy or unimportant ones Another option, available to , readers is 
returning to previously read parts of the text. Skilled readers do this often, 
going back to reread as little as a single .word or phrase or as much as a large 
section of text Efficient readers may also lake advantage o1 the permanence 
of writing by previewing the text to organize further reading (Robinson, 19/0). 
In general, skilled readers take advantage of the permanence of written text by 
efficiently extracting the information they need as they need it, but poor 
readers may not do so(Neville & Pugh, 1976-1977). 

Efficient extraction of information from text requires some skiljs which have 
been generally neglected in studies of reading. Readers need to monitor their 
own comprehension so they can determine when rereading is necessary and 
they need to evaluate what they are reading to determine if it is important and 
needs to be read slowly and carefully Very little is known about how skilled 
readers do this monitoring. and evaluation or about how these skills develop, 
but the limited available evidence suggests that monitoring and evaluating 
may be surprisingly difficult for young children (Markman, 1977; Brown & 
Smiley, 1977) The importance of these skills in reading, their development, 
and how they can be trained are clearly in need of further study. ' 

Interactions of the Differences Between Written and Spoken Language 

Differences from these five categories interact in determining how the skills 
and knowledge necessary for successful reading differ from those necessary 
for successful listening. By way of summary, some of the areas in which the 
novice reader may need to acquire new skills and knowledge will be reviewed. 

Successful reading may require more comprehensive genera! knowledge of 
the world than listening for a number of reasons. Writers generally cannot 
tailor their message to fit a particular reader, while speakers often can. Also, 
writers are unable to receive continuous feedback' from the recipients of their 
message and are not available to answer requests for clarification, as 
speakers'are in many situations. As Socrates tells us in the Platonic dialogue 
Pnaedrus "Written words seem to talk to you as though they were'intelhgent, 
but if you ask them anything about what they say . . . they go on telling you the 
same thing forever " 

Since readers are unable to influence how the message is communicated, 
they must depend upon their own abilities and knowledge to interpret it. The 
use of writing for more informational, rather than interpersonal, 
commun-cation, and the* greater detail and precision found in writing, also 
contributes- to the novice readers' need to increase and expand their 
knowledge 

Novice, readers also need to increase their knowledge of syntax and 
vocabulary over that acquired via listening. The syntax encountered in writing 
may often be more cdmplex, and the vocabulary more diverse, than that found 
in speech Also, since intonation and stress are not available, readers must 
depend more upon syntactic and semantic cues to determine constituent 
boundaries and which terms convey the focal informatioq^FJjrthermore, 
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readers must, comprehend the syntax and vocabulary as it is written; they 
cannot interact with the writer toask for clarification and they are less likely 
than listeners to have the samejnformation restated in a different form. 

Although writing presents some unique difficulties, it also provides the reader 
with some options that, when used properly, can facilitate comprehension and 
oerhaps compensate for some of the ways it is more difficult than listening. 
Since writing is permanent, readers can set their own pace, reread when 
necessary, and preview the material to organize further reading. Efficient use 
of these sampling options requires that readers monitor, their own 
comprehension, so they know when to reread or slow down, and evaluate what 
they are reading, so they can attend carefully to the material that is important 
for their purposes These two skills of monitoring and evaluating may be very 
difficult for novice readers. 

We have described several types of knowledge that novice readers may 
need to acquire or increase, and several types of cognitive processing they 
need to master. It is important to realize that readers cannot simply deal with 
one of these requirements at a time, but must use all these types of knowledge 
and processes at once. Even if a reader is capable of monitoring his or her 
own comprehension, evaluating the material, using syntactic and semantic 
cues \o determine constituent boundaries and the important terms, 
understanding the vocabulary and syntax, and using the required general 
knowledge, doing all of these at once may overcome his or her attention and 
processing capacities. That is, even with each individual component 
mastered, combining them into efficient reading may present difficulties. * 

* » 
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A taxonomy of language experiences 



Children come to the task of reading with a set of well-practiced oral 
language comprehension skills.- This linguistic skill, which a child has 
developed in a remarkably short time, obviously greatly facilitates learning to 
read. In fact, Huey (1908) regards reading as a process of decoding 
orthographic symbolSoto a phonemic representation, then comprehending that 
as if it were speech: "The child comes to his first reader with his habits of 
spoken language fairly well formed and these habits grow more deeply set 
with every year. His meanings inhere in this spoken language and belong but 
secondarily to the printed symbols." Yet the current furor over children's 
inability to comprehend what they read indicates that a child's oral language 
skills in and of themselves must not.be sufficient precursors to -reading. 
Although it is clear that the necessity for visual decoding is a difference 
between children's oral language and reading comprehension, it is but one of 
a great many distinctions, all of which may well present stumbling-blocks for 
children learning to read. This paper introduces a taxonomy of the differences 
between children's typical oral language experiences and the experience of 
reading a book. By delineating the dimensions along which these language 
experiences differ, we hope to understand better the nature of the cognitive 
leap we expect children to make in learning to read. In addition, the set of 
differences between oral conversations and written text gives us the tools to 
specify the relationships among other language experiences such as watching 
television or plays, talking on the telephone and reading comics and to 
understand the cognitive demands each of these experiences makes on the 
child as comprehender. ' 

The majority of a child's oral language experiences may be described as 
interactive conversations in which the child participates as both speaker and 
listener. All the participants share a spatial, temporal and situational context 
• and their verbal communication is augmented by intonation, facial expression 
and gestures. The differences between this situation and that of a child 
reading a story may be divided into two large sub-categories: those having to 
do with the communicative medium and those dealing with the message] each 
of these subcategories is further divided into dimensions. The emphasis here 
will be on medium dimensions and on the new demands these differences 
impose on children learning to read; the message dimensions — topic, 
structure and function — wilLbe discussed only briefly, but a more complete 
description may be found in Rubin (1978). 

There are seven dimensions along which th£ communicative medium of a 
language experience can be placed. If we think of a space defined in terms of 
these seven dimensions, .a child's oral language experience, as described 
above would lie on the opposite end of a long diagonal from reading stories, 
with one point being (0,0,0,0,0,0,0), the other (1,1,1,1,1,1,1). The medium 
related dimensions are: modality, interaction, involvement, spatial 
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commonality, temporal commonality, concr.eteness of referents, and 
separability of characters. While these distinctions are in some measure 
intuitive, there is as yet little or no evidence that any one of them is a crucial 
difference which causes problems for children learning to read. The requisite 
experiments have not been done at least in part because these dimensions 
have not been succinctly identified before. The framework presented here 
should be regarded, then, as a set of hypotheses and suggestions for future 
research Further descriptions ot the dimensions are as follows: 

1. MODALITY — is the message written or spoken? This dimension is the 
one on which most research on the relationship betvs/feen listening and reading 
has focused (see Danks, 1 977, for a review of this work). In fact, that work has 
mainly concentrated on only onfe aspect of this distinction: the added 
necessity of visual decoding in reading. Even, in this single dimension," i 
however, there are other differences which impinge substantially on the 
processing demands of the comprehension task. 

Spoken language has as one of its most salient aspects the use of stress, • 
intonation, and other prosodic features. Temporal characteristics of speech 
such as pauses and changes in speed often provide clues for the chunking of 
words into larger constituents, while stress indicates discourse organizing 
concepts such as the distinction between given and new information (Clark & 
Clark, 1977). The transition to text requires the development of alternate 
strategies to compensate for the disappearance of these features. 

Text does have some compensatory aspects, however. Punctuation 
provides some of the clues for which prosodic features are useful, in contrast 
with speech, segmentation of the message inta words and sentences is 
concretely indicated in written text and is not a task which must bl performed 
by the reader. In addition, certain devices which are used solely in text such as 
paragraphs, underlining and italicizing can help specify the larger structure of 
the message. Effective reading involves recognizing the function of these aids 
and developing processes to take the best advantage of them. 

Another characteristic of text which can be an asset in its comprehension 
is its permanence. Readers can use this fact by looking back over passages 
they have previously read, rereading a sentence which was misparsed ,the 
first time around or re-reading an entire paragraph whose point became clear 
only at the last sentence. A major strategy a child must develop in making the 
transition from oral t6 written languagfe is a method for using the permanence 
of text to compensate for some of its differences from speech . 

2. INTERACTION — is the hearer/listener able to interact with the 
speaker/writer? Clearly, in a conversation, each participant has a chance to 
speak and often uses this opportunity to indicate that he or she has not 
understood the speaker. Thus, in a conversation which is "working," the 
hearer can verify his or her hypotheses quickly, making the maintenance of 
competing hypotheses less necessary. The parallel construction of 
hypotheses is a skill which we hypothesize children, must learn in their 
transition to reading stories. 

Being in a conversation also requires the listener to make an active 
attempt to understand what is being said in order to respond appropriately. In 
non-interactive media such as books and TV this impetus is absent. 
Participatory language experiences are, in addition, highly individualized; each 
participant has. some model of the other's beliefs and knowledge and 
composes utterances taking this model into account. Thus, the language with 
which a child comes into contact in conversatons is more tailored to his or her 



knowledge than the language in a multi-recipient object like a book could be. 
Even in the best of situations, then, a child reading a book will encounter 
unfamiliar material more often than in having a conversation with a parent or 
peer. " 

3 INVOLVEMENT — is the communication directed to the reader/listener? 
The inclusion of this dimension reflects tfle fact tljat certain language 
experiences are directed toward the reader/listener, while in others, he or she 
is essentially "eavesdropping." One clue tolocating a language experience 
along this dime/sion is the use of second person pronouns. An "involving" . 
communication will use "y6u" to refer to the reader/listener, sometimes even * 
in the imperative/ If a "nonmvolvmg" communication contains "you" at all, 
the referent will be a character in the story, or a generalized person ("You 
never"know what's going to happen next."), involvement in a communication 
act usually implies that the writer/speaker knows who the reader/listener is; - 
consistent with this implication is the fact that most written communicatfbns.of 
this sort are derived from oral situations (e.g., letters). 

4 * SPATIAL COMMONALITY — do the speaker and listener (reader and 
writer) share a spatial context? This dimension really comprises two different 
questions. The first might be phrased: Can the participants see one another? 
The second: Can the participants use the same spatial deictic terms bcause 
they are in the same place? 

The first question is primarily one of extra-linguistic communication. 
Gestures, facial expressions and pointing can ail be used to facilitate 
communication A nod of the head may denote agreement; a puzzled look may 
communicate a lack of understanding, causing tha speaker to restructure the 
utterance. Pointing may aid in specifying referents for pronouns or noun 
phrases such-as "that dog over there." 

The sec'ond aspect of spatial corrnrionality has to do with the use of 

deictic words such as "here," "the^e,>x3 :0rTle '' , " go *' etc - ,f the tw0 
participants are in the same place, they can understand sq£h words-without 
translating them to account for the other person's being in a different place. 
The permanence of written language and the existence of modern 
telecommunications have cfeated situations in which the two participants can 
be separated in space, thus making it necessary for the listener to interpret 
spatial deictic terms in the speaker's context. In related work both Tanz (1976) 
and E. Clark (1977) have noted that children's ability to understand spatial 
deictic terms even in oral language is not complete when they enter school. 
5. TEMPORAL COMMONALITY — do the participants share a temporal 
context? 'This again is a deictic issue involving the-use of such words as 
, "now " "today," "last Sunday," and verb tense markers. The correct 
interpretation of such words when the participants are separated in time 
requires the reader/listener to take the point of view of the speaker/writer. A 
child's oral language experience does not often require this ability to switch 
the temporal context of utterances. Although it is certainly possible for a 
mother to address the following remark to her child: "Remember I told you 
yesterday, 'You can go out to play tomorrow'.", it appears that this demand for 
temporal ' context-switching is seldom imposed on a ^ child, in oral 
conversations. . . , 

6 CONCRETENESS OF REFERENTS — are the objects and events referred 
to visually present? Early conversations dpal almost exclusively with concrete 
objects which a child can see: Mommy, Daddy, clothes, food, or objects which 
the child has at least seen previously and which therefore have some concrete 
reality to him or her (Nelson, 1974). In reading or listening to stories, a child is 



ERLC 



145 155 



often required to make lip an object or event given an incomplete, verbal 
description, a process which may take additional cognitive sophistication. The 
child may alsp have to integrate several partial descriptions of the same object 
and remember the composite description without the aid of an external 
referent. 

7. SEPARABILITY OF CHARACTERS — is the distinction between different 
people s statements and points of view clearly indicated? In a normal 
conversation, such distinctions are obvious, as each person makes his or her 
own statements,- each pOint of view has a physical "anchor." Even so, for a 
young child, the parallel maintenance of several distinct points of view may be 
confusing. In a book this problem is compounded, as the child must not only 
"construct" the individual^ involved (see concreteness of referents, above), 
but must parcel out comments, feelings and motivations to each of them on 
the basis of more subtlie clues: punctuation^ paragraph structure and 
inferences based on some ponsistent model of each of the characters. 

These seven medium dimensions', however, account for only some of the 
differences among language experiences. Conversations and texts diverge 
significantly in terms of the' structure, function and topic of the message itself. 
The structure of most! conversations (short statements related as 
question/answer or on a Icommon topic) is a far cry from that of stories 
(episodic structure, inclining setting, character introduction and action 
(Rumelhart. 1975)). Differences in vocabulary and syntactic structure between 
oral and written language; have also been noted (Danks, 1977). In terms of 
topic, many of a child's' conversations center around concrete, familiar 
objects; the transition toi reading involves a move toward more abstract 
subjects often unfamiliar !fo the reader. The function of reading also differs 
dramatically from that ofjhaving a conversation. Children usually engage in 
conversations to get information, cause an event or make social contact. The 
function of texts tends mbre toward informing, persuading, or t in the case of 
early-grade readers, teaching new words. The purpose of these texts often 
escapes children. They cjemand, "Why should I read?"; interestingly enough, 
■they never ask, "Why shpuld I talk?". 

Although these dimensions have been identified and discussed 'by 
contrasting two extremes — children's oral conversations and reading a story 
— there are many language experiences which lie between the two. A 
dimensionalization like the one presented here defines a space within which 
language experiences may be compared and inspires a search for the 
uninstantiated possibilities. We can think of each language experience to be 
described as a point in 7 -dimensional space. At first the space appears to be 
only sparsely filled, but, in fact, we can come up with quite a few intermediate 
points by .teasing apart the dimensions listea above. 

> Take, for example, talking on the telephone, a language experience which is 
' very similar to' a face-to-face conversation except for the lack of spatial 
commonality between participants. This difference implies that a child^talking 
on th*e telephone faces the potential problem of incorrectly interpreting words 
such as "here" because of the spatial context shift necessary to interpret the 
word. An additional hindrance implied by the position of this experience on (he 
"spatial commonality" dimension is the lack of ektralinguistic communication, 
made impossible by the limited communicative medium. Objects referred to in 
the conversation which are in fhe speaker's spatial context are probably not 
immediately visible or accessible to the child. For a child who reltes on these 
aids to comprehend speech, their absence may necessitate additional 
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processing and/or lead to comprehension difficulties. The point is that ttfis 
additional processing is precisely the type\which is necessary in reading 
stones as well. \ #• 

Another example is reading comics. Although this aGtivity shares wiih 
reading stories the act of decoding, it differs from\it along several dimensions. 
Characters in comic books are easy to identify because they are visually 
represented and their words and actions are clearly attributed to them. The 
multitude of pictures also makes it easier for a child to visualize referents 
(animals, castles or dragons) and make it less necessary for him or her to 
integrate successive partial descriptions into a larger one, as the picture 
provides many of the details at first glance. The structure of the text in comic 
books tends to be conversational, so they may, lie cliser to certain oral 
language experiences on the message dimensions (structure, function and 
topic) as well. 

The existence of this space of language experiences, then, allows us to 
mak£ hypotheses about some distinctions between oral conversations and 
written text which may really make a difference to children learning to read. It 
also provides a framework for identifying how other language experiences 
might be related to conversing and reading, what cognitive demands they 
share with each and how we might use them to ease the transition for children. 
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Characteristics of the cloze procedure as a research 
tool in the study of language 
■ * , * * 

The purpose of this paper is to demonstrate the value of the cloze procedure 
in developing comprehension measurements for both written and oral language 
research. It will be argued that the cloze procedure is uniquely qualified for this 
important function due to (a) its theoretical base in an information processing 
theory of learning and communication and (b) its superior psychometric potential 
for generating valid tests of language comprehension. * 

For the past twenty years, the writer has been collecting bibliographical , 
references and documents in which the cloz^ procedure has' been used as a 
research tool for studying a wide variety of language Variables. At the present 
time, approximately six hundred references have been gathered. Although the 
major thrust of numerous papers on the cloze procedure delivered at annual 
meetings of the National Reading Conference has been on th^peasurement of 
reading comprehension and the readability of printed passages, there has been 
a steagy growth in the literature* on many other facets of ittpfgijage, bWh written 
and sjpoken. A rough count of titles in this collection indicates that studies of 
language and cognitive variables constitute the largesj single category in the 
bibliography. In'fact, the number of language studies using the cloze procedure 
is twice as great as those references in which the measurement of reading 
comprehension or readability is the major focus. Many language topics have 
been studied, such as grammar, dialect, oral speech characteristics, foreign 
language facility, listening, contextual constraints in prose, communication- 
effectiveness, drug effects on communication and cognitive processes, divergent 
and convergent thinking, etc. As the psycholpgy of reading has moved into the 
* mainstream of language and/or cognitive theory and research and out of the 
shallow waters of educational methodology, the cloze procedure has served as a 
powerful research tool in facilitating this rapprochement.* This development has 
no doubt' been made possible by the fact that the cloze procedure was derived 
from an information processing theory which is equally applicable to all forms 
and modes of communication. The increasing use of this technique in language 
investigations also stemslrom its superior psychometric characteristics. 

* Theoretical Foundations of 

Cloze Measurements 

It is axiomatic among measurement specialists that most measuring instru- 
ments in psychology and education are not firmly rooted in theory. Reading 
tests, for example, are usuaNy based upon an empirical set of "skills" 

, .stemming from traditional classroom usage. Skills to be measured are selected 
more or less pragmatically in an effort to provide useful information for teachers 
and clinicians. Like most tests, measures of language comprehension are largely 

k 'based upon the shifting sands of empiricism. It is the writer's opinion that 
the cloze procedure produces superior language tests because it has, potentially, 
better construct validity than conventional measurement procedures. 
As we all know, the term "cloze" w#s coined by Wilson Taylor from the term 
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••closure" in gestalt psychological theory. However, Taylor did not attempt 
^ initially to work out a careful and detailed theoretical analysis of the cloze 
procedure in relation to the gestalt theory of perception and learning Despite 
" the use of the term '•cloze," Taylor's theoretical derivation of the cloze pro- 
cedure was essentially based upon a modification of a mathematical informa- 
tion theory" designed by Shannon and Weaver (1 949). Two basic concepts in 
this theory were "information" and "redundancy." Information was defined in 
terms of how much uncertainty was reduced by the accurate reception of a 
messaqe and redundancy was defined in terms of information coming from 
more than one source. It is important to note that the cloze procedure was 
initially derived from a practical engineering strategy for measuring the flow o 
♦information This accounts both for its pragmatic appeal and its theoretical 
significance to increasing numbers of scholars interested in language, and 

cognitive processes. . , . 

It remained for Smith (1971 , 1975) io work out more fully a psycholmguistic 
theory of comprehension which made.extensive use of therwoncept of informa- 
tion processing. It is the information processing aspect of Smith s rather com- 
plex conceptualization of comprehension which is particularly relevant to a fuller 
understanding of the construct validity of the cloze procedure as a measure 
of information processing. Smith viewed individual differences in ability to 
comprehend among readers or listeners in terms of their ability to reduce 
uncertainty through the use of both distributional and sequential redundancy. 
The message receiver continually makes inferences concerning the information 
to be supplied by filling in gaps at choice points throughout a message. 
Thus the process of inference is a .basic component of all comprehension. 
Fillmq in inferential gaps reduces uncertainty and thereby provides continuing, 
feedback as the comprehender actively participates 'in the communication 
process. This theoretical portrayal of comprehension by a reader or listener as a 
" processor of information provides a solid psycholinguistic bas/s for using the 
cloze procedure in constructing tests of language comprehension 

Let us consider, hypothetical^, the process by which a cloze response might 
be made in terms of the previously described theory of information processing, 
assuming a high level of uncertainty at a cloze gap. In taking a cloze 
test as in all normal reading and listening, the individual reacts to information 
from several sources (i.e., redundancy) in making inferences., among words to be 
considered for filling in gaps within a message. In the written or spoken .sentence, 

-The cat climbed the tree in order to„escape from the dog , the 

comprehender would react to the redundant information stored within his 
nervous system together.with visual and/or auditory informanon in the message 
in making a cloze response. Previous experience, either direct or vicarious, 
„ about dogs chasing cats and cats climbing trees would provide a greater 
source of information than that found oh the page or in the spoken message. The 
comprehender might react first to the sentence structure and eliminate such 
possible words as nouns, verbs, adjectives, and adverbs as alternatives to be 
considered within this context. By making use of sequential redundancy con- 
cerning grammatical patterns, he would reduce considerable uncertainty 
because he has only ,the category of function words to be considered as 
possible choices. Next, the comprehender might makp use of distributional 
as well as sequential redundancy in making inferences. Among function 
word alternatives, certain words are more likely to occur than others inde- 
pendently of verbal sequence. For example-, the function words up or down 
are statistically more likely to be used than the function word throughout 
not considering their context. The skilled comprehender has both sequential 
and distributional redundancy "rules" programmed in his nervous system. 



Semantic clues would also suggest that the word "up" would be a better choice 
thgn the word "down" as a more likely means of escape from the dog. 
Other overlapping sources of information used in reducing uncertainty at the 
cloze gap might be visual-orthographic and/or auditory-phonemic sequences. 
The skillful comprehehder unconsciously makes efficient use not oiftly of multiple 
sources of information but also of overlapping types of redundancy to reduce 
uncertainty at cloze gaps randomly distributed throughout a message. In some 
messages, the sophisticated message receiver could use elements of the 
author*s writing style and even aestheticelements in making cloze inferences. 

It is important to realize that all readers and listeners carry out some such 
process of inference and redundancy utilization as previously described 
in comprehending messages. We only tend to become aware of this inferential 
process when there is a high level of uncertainty at a gap. The main differ- 
ence between taking a cloze test and reacting to a normal written or spoken 
message is that in the real world the reader or listener selects his own gaps in 
relation to his own background of information, his current motivational 
State, temporary fluctuations in his attention, etc. In the cloze message, the 
gaps are arbitrarily chosen by- the test constructor. However, any test situation 
is necessarily somewhat artificial. The writer believes that this model of 
information processing is valid and supported by a growing body of research 
and theory The "goodness of fit" between this model and the cloze procedure 
is impressive and very significant to the serious language researcher. 

Unique Measurement Characteristics 
for Language Studies 

Let us consider characteristics of cloze measurements from the standpoint 
of measurement theory. At first blush, a cloze measuring" instrument for 
assessing comprehension is not very convincing. The naive observer might a£k, 
"How could anyone reasonably be expected to fill in all of those blanks?" 
. After all, a cloze test looks (or sounds) much like the type of fill-in-the-blank 
exercises used to measure rote learning in many workbooks. Moreover, 
sophisticated critics 'have maintained that, on its face, a cloze test cannot 
conceivably measure comprehension directly. It is charged that if cloze tests 
correlate highly with other comprehension measurements, then surely this must 
be due to the* fact that both the cloze and the comprehension tests are 
measuring some common variable. 

Now there is no doubt that the cloze procedures tests which are lacking. in 
face validity. A cloze test does not look or sound like a normal communication, 
but the previous analysis of comprehension in terms of information processing 
theory should be sufficient to dispense with the . reactions of^the naive 
observer to a cloze measurement. However, the criticism that the' cloze test 
does not measure comprehension directly but measures some pther variable 
which is correlated with comprehension must be taken more seriously. 

It has been asserted that the correlation between cloze measurements and 
other comprehension measurements is due to the fact that both are measuring a 
general verbal competency. There is no doubt some substance to this con- 
tention High correlations are usually found between cloze test scores and 
measures of verbal aptitude. However, this is also true of other comprehen- 
sion tests. In any event, the correlations between both cloze test results and 
conventional comprehension results on the one hand, and verbal aptitude tests 
on the other, do not account for a large amount of the variance in common 
*to both 'distributions. Therefore, cloze tests measure something more than 
* verbal aptitude. 
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A more serious criticism has been made by Carroll (1972) to the effect that 
cloze scores are largely influenced by linguistic clues in the immediate 
context^ around the missing word and therefore, cioze scores do not assess he 
Sty o comprehend major ideas in a message. If this were the case, he use . 
of cloze tests^as comprehension measures per se woud be quest^e 
indeed This criticism, together with a finding by MacGmitie (1961) to the Mqrt 
that context of more than five words around a blank in a printed cloze test 
does not help in making the correct response, has been widely quoted. However 
3 should be noted that MacGinitie also pointed out that constraints may operate 
over longer distances, since knowledge of the topic might have an nfluence 
which could extend beyond local contextual boundaries. The latter statement « 
important to keep in mind. Since the cloze procedure measures language 
correspondence between a message source and receiver, redundancy resides 
° both syntactic and semantic^information within the shared language 
correspondence This is obviously not restricted by localized visual or agditory 

^sludiesbTDarnell (1963), Ramanauskas (1 971 ), and others tend tp^w that 
cloze responses are indeed sensitive -to longer range constraints/nan reported 
by MacGmitie These studies show that scrambling sentence o^er produces a 
significant influence on cloze responses within sentences This±ould hardly be 
me case if cloze tests measure only "local redundancy." Thi^issue is not by 
anv means "clozed." but the writer is convinced that the evidence&oes not at this 

time give strong support to Carroll^ position. 

From a theoretical point of view, a good case can be made for the con- 
tention that cloze measurements do in fact measure comprehension mo e 
"directly" than conventional comprehension tests. Five points will be ^ made to 
support this contention. First, cloze tests are intrinsic measures of the .effec- 
tiveness of communication by sampling the degree of language correspondence 
between a message source and a receiver. Substantia ly the same results are 
obtained whether the scoring is done by exact word method or by the .synonym 
method This could hardly be the case if comprehension of the communication 
wl e not being tapped directly. Second, , cloze tests measure comprehension 
" process, not comprehension as a product after the fact. Answering large ■ 
numbers of questions after the communication has been received (as ,n the con- 
ventoral comprehension test) is not as direct a measure of the communication 
"n process as can be obtained by the cloze test Third, all cloze rested 
based upon the basic psycholinguist process of inference which is in rinsic to 
all communication. We avoid overloading our short-term memory by tuning in 
and out selectively and filling in the gaps in qoth oral and written communica- 
tors I s precisely this process that is tapped by all cloze items in varying 
degrees This cannot be said of conventional comprehension measurements. 
Fourth cloze tests sample more or less randomly the choice points for pre- 
'disability within a message. What gther comprehension test can attain such 
Ssed item' samples from a universe? Fifthsunlike any other communication 
test forma, capable of measuring higher level though. P<°^vSj"^ 
writing leap's itself to precise replication by independent writers. Although all 
wSehension tests impose some degree of .artificiality upon the message 
recover Tn the measurement process, the intimate relationship between language 
Lnd learning theory and cloze measurement provide a more direct and natural 
Sing Stionjn many ways, than provided by conventional comprehension^ 

,e Cloze measurements have been constructed and interpreted in the tradition 
of what has been called "classical test theory." As such, they have been de- 



signed, to yield maximum variance and reliability apd have received a norm-^ 
referenced interpretation . A/Vithin this theoretical framework, the<cloze procedure 
produces excellent measuring instruments. If the goal of comprehension 
measurement is to obtain reliable arid valid measures of even small differences, 
in comprehension among people, cloze tests are most useful for this purpose 
as indicated by numerous studies of criterion related validity. The high corre- 
lations among cloze comprehension measurements for different materials 
written at various levels of difficulty indicate that these measurements are 
tapping comprehension in a very genera.1 or global sense. 

Conventional cloze tests also permit a type of criterion-referenced interpreta- 
tion, provided that it can be demonstrated empirically that people who fail to 
^score above a'given cut-off score cannot perform designated language tasks 
; at a satisfactory level of competence. It should be said, however; that cloze 
comprehension tests do not provide criterion-referenced measurement in the 
sense of a Jest wftich can be interpreted in terms of mastery or non 
mastery of specifically defined behavioral or instructional objectives. A cloze 
test has proven to be a superior measure of general comprehension but does 
not lend itself easily v to the measurement of specifically defined language'com- 
prehension processes. 

In an interesting theoretical paper on reading comprehension Simons (1971) 
states, "The cloze test is a better measure of reading comprehension ^than 
traditional tests because it appears to measure fewer of the extraneous aspects 
of student functioning enumerated above, (sic) Specifically, it does not have 
questions and therefore is not measuring a student's skill in understanding 
questions. It is not a memory test because a student can continually read and 
examine the passage. It also does not appear to be measuring a student's 
familiarly with the content of the passage, at least to the degree that 
traditional^ tests do The mechanical procedure for developing thb test (the 
deletion of every pth word) renders it more objective and less subject to the 
arbitrary judgments of the test constructor than traditional comprehension tests," 
Although Simons concludes that the cloze test of reading comprehension lacks 
construct validity for the same reason that all tests of reading comprehension do 
(the absence of a theory of the comprehension process), the writer believes* 
that the subsequent elaboration of a psycholinguistically oriented information 
processing' theory ,of comprehension by Smith provides a potentially stronger 
construct validity for cloze tests as measures of language comprehension 
than is characteristic of other comprehension tests at the present time. Empirical 
studies determining the construct validity of cloze tests by testing hypotheses 
derived fronr information processing theory would be desirable. Such studies- 
would, hopefully, provide still stronger evidence for the value of cloze measure- 
ments in language research. 
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The relation of comprehension to sementic and syntactic 
language, cues utilized'during oral and silent reading 



Within the past twenty years, cldze tests.have been analyzed using a variety 
of deletion patterns (Rankin, 1959; Weaver & Kingston, 1962;- Ohnmacht, 
Weaver, & Kohler, 1970; Ohnmacht & Fleming, 1972; Greene, 1965; Louthan, 
1965), scoring procedures, (Bormuth, 1965;, Vaughn, Tierny, & Atpert, 1977), 
bnd research designs and statistical analysis (Weaver & Kingston, 1962; 
Bormuth, 1§69; Ohnmacht & Fleming, 1970; Horton, 1975), in an attempt to 

^describe what construct cloze measures. With the exception of the' Weaver 
.and Kingston (1962) study, research has suggested that ctoze is related to a 
construct or constructs identified as comprehension and/or the abjlity to deal 
With the structures of, language. Since research with the cloze procedure hajj 
been inconsistent in type of deletion patterns, text, age*of subjects, andA&Si 
criteria, a sound empiricalsdescription or theoretical framework of the prpcess 
required of an individual to complete a cloze test, and consequently what is 
measured by cloze, has not been clearly delineated. Although variations of 
deietion patterns are quite common in cloze research, the scoring procedure 
based on the percent of correct (exact word or acceptable synonyms) 
responses has dominated cloze research. If it is the format which'tirst - 
determines. the validity of cloze as an assessment" instrument, then analysis of 
a cloze test Should utilize more fully the information this format can yield, that 
is, cloze should be conside/ed a medium through which one can evaluate 
those language processes related to the reading and completion of a cloze^ 
task. A qualitative analysis of cloze responses would seemlo provide an in-'" 
depth view of the language abilities utilized in the reading of a cloze passage ' * 
Relying upon Goodman's (1969, 1972, 1973)' view of reading, as. a 
psycholinguistic process, it is then possible to see that the cloze procedure 
can provide "little windows" to the language-based strategies a reader uses 
to read a passage. Accepting the view that reading is, "one of the four 
language processes," (Goodman ^976a, p. 89) it thus seems sound to 
approach analysis of both oral and silent reading in terms of the elen ents of 
language which are integral to reading ability. According to Goodman, reading 

" is a "spiral of ptedicfl sample, select, guess and oonfirm activities" (1973, p. 
295). One may thrf infer that comprehending is also a continuous and 
simultaneous part ofthis activity: If reading is accepted as language process, . 

'the accusation that performance tests of reading do not reflect the readmg 
process in relation to a model of language competence (Roush, 1976) should 
be examined. 

At least two researchers have hinted at the feasibility and utility of coupling 
miscue analysis and the cloze procedure. Brown (1968) suggested that a 
scoring procedure which would allow a qualitative decision about the wOrds 
used to.complete a blank be applied to the cloze procedure. Page (1974) saw 
research combining the cloze procedure and miscue analysis as promising 
"extensive contributions to understanding the reading process" (p. 168). 



Combining some of the questions raised by cloze research with the findings 
and theories of miscue analysis, this study attempted to examine the relation 
of cloze, comprehension, silent and oral reading, Miscue analysis was used to 
quantify syntactic and semantic language cues utilized during oral and silent 
reading tasks. Specifically, the questions examined in this study were: 

1) How do cloze syntactic and semantic mean change scores relate to 
total cloze test scores? 

2) How do cloze test scores and oral reading relate to a standardized test 
of reading comprehension 9 

3) How do semantic and syntactic mean change scores for content 
words and function . words relate to literal and inferential 
comprehension? 

Method 

Subjects 

The subjects for this study were 68 seventh-grade students drawn from a 
sample identified as reading at least one year but not more than four years 
below grade level, according to the Stanford Achievement Test. The subjects 
attended a middle school in a rural area of northeast Georgia. 




Procedure n 

The Reading Comprehension subtest of the Stanford Diagnostic Reading 
Test, Level II, Form W (SDRT), (Karison, Madden, & Gardner, 1966), was used 
as a critenon'measure of literal and inferential comprehension. 

A filth word deletion cloze test was constructed from a story written by the 
researcher. The story was in a narrative style, told through the main character. 
No intentional contrcis were placed on syntax. There appeared to be a 
balance among simple, compound, and complex sentence structures. The plot ^ 
of the story concerned a girl's attempt to establish a cc-educational track ' 
team at her junior high school. The readability of the passage was estimated at 
upper seventh-grade .according, to the Readability Gra';h (Fry, 1968) and 
seventh-grade according to the Flesch Reading Ease Formula (Burmeister, 

1974) ■ » 

After the cloze test was constructed, the words delated were divided into-" 
two categories content and function words. Of the 107 total deletions, 69 
were content words and 38 were function words. The KR 2 i reliability estimate 
of this cloze test was .73 . 

The cloze test was Scored using three different procedures. First, the total 
cloze score was based on the number of exact word replacements. Second, 
the scoring procedures described in the Reading Miscue i; /entory (RMI)' 
(Goodman & Burke, 1972), were used in order to yield grammatical 
acceptability and strength of comprehension scores. The third procedure.was 
from The Goodman Taxonomy of Reading Miscues (Goodman, 1976b). The 
'Taxonomy'' was used to calculate semantic and syntactic mean change 
scores These were used to analyze the degree of syntactic and semantic 
change for all miscues which were identified as grammatically and 
semantically acceptable in the RMI scoring procedure. 

The third instrument was en oral reading passage which was analyzed 
according to procedures described in the RMI and The Goodman Taxonomy 
of Reading Miscues. Scores indicating grammatic acceptability, strength of 
comprehension, and syntactic and semantic mean change were obtained for 
the oral reading of the passage. The passage was the intact version of the 
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cloze test Jt was administered to the subjects according to the procedures 
described in the RMI manual 

The SDRT was administered first, followed by the cloze test. The individual 
administration of the oral reading passage followed one week after the 
administration of the cloze test. 

Analysis V * 

Pearson ^product-moment correlations and a forward inclusion step-wise 
multiple regression were used tcrexamme the relations among the identified 
variables. A f-test for the significant difference between correlations was used 
to determine if syntactic and semantic mean change scores differentially 
predicted total cloze scores. • 

Results 

Significant but small relations were found between cloze syntactic mean 
change scores (r = .31 , p < .01) and semantic mean change scores (r = .28, 
p < 05) and total cloze scores. These scores did not differentially predict total 
cloze scores, t (65) = .314, p > .05. 

A significant, moderate relation was found between cloze comprehension 
scores and the total comprehension scores of the SDRT (r = .56, p < .01). No 
stable relation (r - -.19, p > .05) was found between oral comprehension 
scores and the total comprehension scores of the SDRT. 

The results of the forward inclusion stepwise multiple regression for 
syntactic and semantic mean change scores as predictors of literal 
comprehension are' reported in Table 1. There was a significant, moderate 
relation between the two predictors, and .literal comprehension. The resulting 
multiple R for these two predictors was .445; the R' of, .415 indicated that this 
is a good estimate of the population correlation. Predictors three and four 
together added only two percent in explained variation to literal 
comprehension. There was a significant, moderate relation between the two 
predictors, cloze semanlic-mearHeha-nge scores for content words and oral 
semantic mean change scores for function words, and literal comprehension. 
The resulting multiple R containing these two predictors was .497; and the R' 
of 473 indicates that this is a good estimate of the population correlation. The 
remaining predictors together added only three percent in explained variation 
to literal comprehension. 

The resuits of the forward inclusion stepwise multiple regression for 
syntactic and semantic mean change scores as predictors of inferential 
comprehension are reported in Table 2. There was a significant but small 
relation (fl = .246) between oral semantic mean change scores for function- 
words and inferential comprehension. The remaining two predictors did not 
make a significant contribution to the explained variation in inferential 
comprehension. The fourth predictor did not meet the F-test for inclusion in 
the regression equation. 

It appears that there is no significant relation between the syntactic 
predictors and inferential comprehension. 

Discussion 

Since cloze syntactic and semantic rnean change scores did not differentially 
predict total cloze scores this may indicate that the subject's ability to provide 
responses which are syntactically similar to an expected response in a cloze 
blank does not provide a more reliable basis for predicting total cloze scores 
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Table 1 



Summary of the Stepwise Regression Analysis of 
Predictors of Literal Comprehension 



Predictor Multiple 
Variable ft 
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Cloze Syntactic Function .394 1 2.097 * 
Word Mean Change Scores 

Cloze Syntactic Content .445 8.022* 
Word Mean Change Scores 

Oral Syntactic Content ' 1 464 5.843* 
Word Mean Change Scores 

Oral Syntactic Function .466 4.350* 

Word Mean Change Scores , 



Cloze Semantic Content .41 1 1 3.446* 

* Word Mean Change Scores 

Oral Semantic Function .497 10.667* 

Word Mean Change Scores 

Cloze Semantic Function 51 8 7.838* 

Word Mean Change Scores 

Oral Semantic Content .522 5.887* 

Word Mean Change Scores 

*P<-01 

than does the ability to provide semantically similar responses. Thus it seems 
that a fifth word 'deletion cloze test can be used to analyze an individual's 
ability to utilize both syntactic and semantic language cues in the completion 
of the cloze task. It appears that these data support. Weaver's (1965) view that 
both syntactic and semantic constraints in the cloze procedure affect both 
structural and lexical language elements. The correlations in this study 
accounted for very little of the variation in total cloze scores, but that variation 
is being accounted for totally by the miscues that were either syntactically or 
syntactically and semantically acceptable to the entire passage. It is likely that 
a scoring procedure that wou^d take into account the correct' replacements 
would help contribute to the variance accounted for by the miscues in this 
analysis. Brown's (1968) view that the use of synonym scoring in cloze 
probably measures the reader's full language competence appears now to be 
somewhat simplistic. Semantic mean change scores accounted for only eight 
percent of the variation in total cloze. It does not seem likely that this scoring 
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Table 2 



Summary of the Stepwise Regression Analysis of 
Predictors of Inferential Comprehension 



• 


Predictor 
. Variable 


Multiple 
R 


F 


v— 


• 


Oral Semantic Function 
Word Mean Change Scores 


.246 


4.258* 






Cloze Semantic Content 
Word Mean Change^Scores 


.293 


3.044 






Oral Semantic Content 
Word Mean Change Scores 


.338 


2.748 




■ t 


Oral Syntactic Content 
Word Mean Change Scores 


.207 


2.965 




< 


Cloze Syntactic Function 
Word Mean Change Scores 


.272 


2.596 






Cloze Syntactic Content 
Word Mean Change Scores 


.286,* 


V.898 






Oral Syntactic Function 
Word Mean Change Scores 


.288 
*p<.05 


1.426 





procedure alone accounts for total language competence. Certainly, degrees 
of semantic acceptability along with syntactic acceptability and the ability of 
the reader to recognize .language inconsistencies and correct them enter into 
the evaluation of language competence. 

in order to discuss the relation between cloze comprehension scores and 
the total scores on th£ SDRT, it must be noted that comprehension scores for 
this analysis included only those miscue patterns involving correction, 
semantic acceptability and meaning change \yhich resulted in the miscue 
being assigned to the category of "no loss- of comprehension.:' Since cloze 
• miscues which indicated no loss of comprehension accounted for 
approximately 31 percent of the variation in SDRT scores, this finding seems 
to support the use of the cloze procedure combined with the miscue analysis 
to evaluate the reading process. The moderate relation of the cloze miscues to 
the comprehension subtest Leads to the inference that the production of a 
miscue which is syntactically and semantically acceptable and with little or no 
loss in meaning is, at* least, partially accounting for comprehension ability. 
Looking toward the future of cloze as a silent diagnostic test, it would be 
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necessary to examine the role f>f retelling of the passage after the completion 
of a cloze test would play What appeared to cloud the issue was failure to 
establish a stable relation between the oral strength of comprehension scores 
and the comprehension subtest of the SDRT Acceptance of the concept that 
silent and oral reading processes are similar would lead one to expect that oral 
reading would also have some relation to comprehension. This discrepancy is 
not considered conclusive, but rather an area for further research. This finding 
appears to provide further empirical support for use of a retelling to evaluate 
comprehension following an oral assessment. 

Findmas concerning the ability- of cloza and oral syntactic and semantic 
mean change scores for content and function words to predict comprehension 
did not reveal one test or cue system to consistently predict literal or 
inferential comprehension. Cloze syntactic mean change scores for function 
and content words were better predictors of literal comprehension than their 
oral reading counterparts. For semantic based language cues, cloze semantic ' 
mean change. scores for content words was the first best predictor of literal 
comprehension, while oral semantic mean change scores for function- words . 
was the second best predictor. The single best predictor of inferential 
comprehension was oral semantic mean change scores for function words. 
Yhe ability to perceive interrelationships between ideas (i.e., inferential 
comprehension) may be based on competence in dealing with structural- 
meaning (Rankin, 1959). The ability to use this competence in ofal reading 
appears to override the ability to use structural meaning in cloze when viewed 
as a predictor, of inferential comprehension. However, given the mutilated 
characteristic of the cloze passage, it is probably more difficult for4he reader 
to perceive overall interrelationships among the words, sentences, and ideas 
in the cloze passage. There may not be enough information available to the 
'reader to allow the production of miscues that keep the basic relations in the 
passage consistent to what was actually written by the author. 

The findings ®f this study are not completely consistent with findings by 
Rankin (1959) The cloze test used in the present study was a fifth (nth) word 
deletion Rankin found that nth word deletion cfoze tests were highly related to 
a test of inferential comprehension. He concluded that an individual must rely 
more on understanding the interrelationships between ideas in order to 
complete an nth word deletion cloze test. His conclusion does not explain why, 
in the oresent study, the ability to use structural language cues in a cloze test 
predicted literal comprehension but not inferential comprehension. It is 
possible that the ability of an individual to utilize syntactic language cues in a . 
ctoze test ma^play a more important role in the analysis of literal reading 
comprehension skill than previously recognized. The nth word deletion cloze 
test, may be the means through which diagnostic data related to literal 
comprehension can be obtained. 
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Assessing reading performance at the secondary level 
through the utilization of a cognitive self-rating scale. 

While research has investigated the relationship between self-perception 
and general academic performance, there is a paucity of research exploring 
the relationship between one's self-perception and his/her reading ability. 
Research (Peters & Peters, 1976; Peters, 1977) investigating this topic has 
"revealed j;haf when the score obtained from a. cognitive self-rating scale is 
combined with the score from a general measure of reading performance, the 
ability to predict reading performance was improved when compared to a 
method which utilizes.a general measure of reading performance. 

Since results may vary with the type of comprehension probe utilized (e.g., 
general vs. specific readfttg performance, Artley,' 1944; Maney, 1958; Peters, 
Peters & Kaufman, 1 975; Sochor, 1958) this study sought to ascertain whether 
a Cognitive Self-Rating Scale (SRS) would be a- valid and reliable measure of 
reading "performance when compared with the results obtained from an 
instrument designed to measure specific reading skills associated with one 
cognate area. Therefore, the purpose of this study was to investigate the 
followrng questions: (1 ) Does the SRS*measure reading ability? (2) If the SRS is 
a measure of reading ability, does it distinguish between various levels of 
reading ability (good and poor)? (3) Is the SRS capable of measuring distinct 
dimensions of the reading process? (4) Does the SRS predict reading 
performance 9 , 

Method 

Subjects 

The sample was comprised of 59 students from a suburban junior high 
school located in an upper middle class community. The students were 
enrolled in a regular eighth grade English class. 

Reading ability was determined on the basis of individual performance on a 
Coritent Specific Reading Test (CSRT). Good readers were defined as those 
students who had a range in raw scores from 35 to 50 while the poor readers 
were defined as those students who had a range in raw scores from 10 to 25. 
Students whose raw 'Scores ranged from one to nine were eliminated, because 
< tljey were considered to be disabled readers who lacked sufficient skills to 
read correctly the items on the SRS. Those students whose raw scores ranged f 
from 26 to 34 were eliminated to prevent a potential Qverlap between the good ' 
and poor reader categories. There was a possible raw score of 50 on the 
CSRT Reliability (.92) and content validity for the CSRT were established in a 
previous study (Peters, Peters, & Kaufman, 1 975). 

Materials 

If the SRS is to be predicated upon a measureable construct, then it must be 
comprised of those' factors research indicates are related to the reading 
process. Based upon this premise, the SRS is composed of four factors 
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research has shown are related to the comprehension process: knowledge of 
concepts, semantic structure of material, cognitive skills associated with the 
comprehension process, and time required to process printed information. 
Statements reflecting these factors were incorporated into the 15 item SRS 
(seeTabJe 1). 

Table 1 
Cognitive Self-Rating Scale 

1 . When compared to other members of the class, I would rate myself as 
a very good reader. 

2. I feel that most of the material we read for school is too easy. 

3. It takes me a great deal of time to read most of my assignments, 
because I am a slow reader. 

4. I do not read most of the material assigned, because it is too hard. ^ 

5. If I selected the material for. my classes, I would pick books that are 
' easier to understand. 

6. When I complete my reading, I have no difficulty answering the 
questions asked by my teachers. 

7. heading school materials is difficult for me, 

^-Wbef^GGmpafe4 toeteef fnembef^of the class, i woufd rate mysetf as 

■a poor reader. 

9. Most of the material I read in school contains many. words that are 
difficult to'understand. 

10. I have very little trouble understanding what I read in my classes at 
school, o 

11. Material I read in school is easy. 

12. I usually do not understand what is happening in the material I read for 
school. * 

13. When I read material for school, there are very few words I . do not 
understand. 

14. When compared to other members of the class, I would rate myself as 
\ an average reader. 

1 5 I have to read material assigned in school over and over to understand 

rt. • 



The determination of reading ability was based upon students' responses tp 
a four point" Likert scale. A value of one was assigned to each response 



4 . 1 

considered to be indicative of good reading behavior. Good readers were 
defined as those students whose scores ranged from zero to seven. An 
internal consistency coefficient of 76 for the total scale was determined by 
the K-R 21 formula. 

Design 

Five statistical procedures were utilized to analyze the data. .First, two 
correlational matrices were utilized to analyze the relationship between the 
CSRT and SRS and the interrelationship between items on the SRS. Second, a 
factor analysis (varimax rotated) was utilized to ascertain whether the SRS 
was comprised of distinct elements. Third,. a t test was utilized to ascertain 
whether the SRS discriminated between good and poor readers. Fourth, a 
regression analysis was used to determine the predictive validity of the SRS, 
and fifth, a crosstabulation of the results from the CSRT and the SRS were 
performed to ascertain the accuracy of placement by reading ability. 

Results 

To determine whether the SRS is a measure of reading behavior, it was 
correlated with CSRT. The results of that comparison reveal that the SRS does 
measure some dimension of the reading process. The correlation between the 
two instruments is moderately low (r = .352). While the SRS and the CSRT are 
related, they are measuring significantly different (p < .01) dimensions of the 
reading process. 

Since the SRS is a measure of reading behavior, did it- measure distinct 
elements within the reading process? A factor analysis reveals that the SRS 
contained five factors with Eigenvalues- above 1.0. These factors accqunt for 
67 percent of the variance. To facilitate the identification of the item's ilp each 
of the five groups a second factor analysis using the varimax rotation process 
was performed. The items grouped in each of these five categories were then 
compared to the original groupings. The results of this comparison reveal that 
there is no consistent matcfh between the items grouped in the four original 
categories and the items in the five categories identified in the factor analysis: 
While the items within the original four categories do not seem to be distinctive 
groups, the SRS does appear to be comprised of five separate factors. The 
stability of these groupings is confirmed when the intercorrelational 
coefficients are examined. All items in each of the *fiye groups were 
moderately correlated. , 

The analysis of the results from t tie f test indicate that the SRS, does 
discriminate between good and poor readers (see Table 2). The difference is 
significant, t (57) = -2.59, p < .01, means: 20.72 and 16.87. However,*when 
the results are analyzed oq a subtest basis (contextual analysis, main ideas, 
details, sequence, and critical reading), only- two subtests discriminate 
significantly between good and poor readers, main idea, t (57) = -2.21 p < 
05. and sequence, f(57) = -2.05, p<. 03. 

While the SRS is capable of discriminating between good and poor readers, 
the regression analysis of pre-CSRT scores and post-SRS scores reveals that 
the SRS is not a good predictor of general reading performance, F (5, 53) = 
2 05 p < .052, since the total variance accounted for was approximately 16 
percent. However, when the crosstabulation of placement by reading ability 
on the SRS and the CSRT is examined, it reveals that the two instruments are 
extremely consistent in their placement of poor readers. Only 1.7 percent of 
the students classified as poor readers by the CSRT were classified as good 



Table 2 

Means ofVaood and Poor Readers.on the CSRT 



Subtest 



Ability 
Group 



Mean 



S.D. 



Contextual 
Analysis 


Poor 
Good 


3.25 
3.94 


1.38 
1.33 


Main 
Idea 


Poof 
Good 


\ 

N4-.25 
• 5\07 


1.29 
0.93 


Details 


Poor 
Good 


3.25 
4.11 


1.75 
1.21 


Sequence 


Poor 
Good 


2.0 
2.88 


0.92 
\ 1.16 

\ / 
> 1.55 / 


Critical 
Reading 


• Poor 
Good 


4.12 
4.70 


Total 
Score 


Poor 
Good 


16.87 
20.72 ; 


4.01\/ 
3.89 /\ 



reader^ by the SRS. The converse is true of good readers sine^e 42.4 f^rcent of 
those students classified as poor by the CSRT were classified as good readers 
by the SRS. / \' 

Conclusions and Implications 

The purpose of this study was to determine the utility of an instrument 
designed to measure students' self-perception of their reading ability. te> the 
results of the correlation between the SRS and the CSR] indicate, jthe SRS is 
related to reading ability. This finding is consistent with previous research 
(Peters, 1977), but more importantly, the SRS 1s both a n^asure of gfeneral and 
specific reading ability. 

As.the results presented in Table 2 reveal, the SRS alfco has the capability of 
discriminating between good and poor readers. While ihe SRS does have this 
discriminative ability, the percentage of placement of/good and poor readers 
varies from instrument to instrument. The largest discrepancy exists between 
those readers classified as poor by the CSRT and gc/od by the SRS; forty-two 
percent of the readers fall into this category,, This 
can be partially explained by the fact that the C 
placement of poor readers. It classified 54 perc 
readers. Given the educational composition of 



disc 
gc/od I . 
iscrepancy in placement 
IT is too inclusive in its 
of the sample as poor 
e population, it is highly 



unlikely that such a large percentage of the group would be poor readers. 
Restructuring the parameters used to define p<por readers might produce 
different results. / 

However, there is a high degree of consistency in the placement of poor 
readers by both instruments. Only 1.7. percent /of the students classified as 
poor readers by the'CSRT were classified as good readers by the SRS. When 



'the total number of poor readers placed by the SRS, 12 percent, is compared 
with the percentage of poor readers placed by the CSRT, the SRS figure seems 
more realistic when one considers the average distribution of poor readers in 
a representative sample £ . 

Since the SRS appears to be an accurate predictor of poor readers, one . 
possible function of the instrument might be as an initial screening device. For 
example content teachers who are confronted with large numbers of students 
might find the SRS highly useful. It could be administered at the beginning of 
the year or semester. Since the instrument is economical in terms of time 
required to administer and score it, teachers could quickly identify those 
students who may be potential poor.readers. 

The utilization of the cognitive seff-rating scale helps illustrate the 
significance of student input into the diagnostic process. The results of this 
study suggest that supplemental forms of cognitive assessment can be 
employed when attempting to measure reading performance. The diagnostic 
process at the secondary level- needs to become less myopic and more 
comprehensive because many times critical instructional decisions are made 
on the basis- of one instrument. While more research needs to be done with 
cognitive self-rating scales, they do have the potential for becoming a very 
useful instrument * 
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The comprehension of prose: 
problems in measuring learning outcomes 

A major function of our schools is to communicate knowledge. 
Undoubtedly, the largest proportion of this data is communicated through 
printed verbal piaterial; the student is required to read in order to learn. 
Because of its importance, reading comprehension has been widely studied. 
The educational psychologist has contributed to this burgeoning literature, 
usually in an effort to find means of facilitating reading comprehension as well 
as measuring it. Many of thesp pfose learning studies have focused on 
manipulating thQ- learner's task instructions in order to assess how Ihese 
■ instructions influence what is learned. These are certainly commendable 
efforts. Finding ways to facilitate learning, in this case prose leiarning, cannot 
be faulted. However, a careful review of a large sample of the Jiterature 
reveals that most often the learning outcome that is assessed is educationally 
trivial. For example, a learner may be asked to recognize or recall a series of 
words taken from the instructional material. Measuring this type of learning 
outcome (e.g.; names, dates, quantities, technical terms) can only guarantee 
that rote learning has occurred. Recognizing the surface structure of a 
linguistic input does not insurejhat understanding has occurred'. Education * 
and, therefore, research should be more interested in meaning than in 
memory for the verbatim input. 

0 This is not a new cry, of course. Others have made the same plea to 
'measure comprehension rather than memorization (Allen, 1970; Andre & Sola, 
1976; Cunningham, 1972; Duell, 1974; Felker & Dapra, 1975; Hiller, 1974; 
Shavelson, Berlingr, Ravitch, &'Loeding, 1974; Watts & Anderson, 1971). This 
paper is an attempt to review research which demonstrates the distinction 
between rote and meaningful learning of written, text and the implications of 
that distinction for the measurement of learning outcomes. In most reading the 
& retention of discrete and specific information is less important than the 

meaningful comprehension of basic propositions, the making of required or 
invited inferences, or the integration of ideas into a meaningful, unified whole. 
Research has indicated that a learner's memory is quite often an abstract 
construction created from the text itself rather than a reproduction of it. Since 
most classroom learning is designed to become incorporated into seemingly 
stable long-term memory, the distinction between reproductive and 
constructive processes has implications for the measurement and evaluation 
procedures of students' learning through ' reading. A sample of current 
research findings will, accordingly, be analyzed in terms of the assessment 
techniques utilized, and the type of dependent variable under scrutiny. A 
taxonomy of these variables will be presented based upon the distinction 
between reproductive and constructive processes mentioned dbove. The 
validity of assessment devices will be discussed and implications for research 
design will be made. 
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Meaningful Learning . 

In meaningful learning the effort is fo acquire understanding. Understanding 
or comprehension 'is the result of relating new material to what one already 
knows (Ausubel, 1968. Haviland & Clark, 1974; Kmtsch, 1974, Smith, 1975). If 
the learner is successful in relating new material to cognitive structure, he has 
acquired meaning Meanings are more abstract than the words used to 
convey them They refer to the substance or '•gist" of the message and are 
Generally not word specific Meaning is a function of those concepts that are 
.used to express it (Kinjsch, 1974) A concept can usually be expressed by 
several synonymous words which still communicate the same meaning. In , 
acquiring meaning, then, readers can store meaning in the form which is most 
readily incorporated into their particular cognitive structure. They camuse 
those concepts and interrelationships among concepts which are most 
"meaningful" to them. ' . . • 

If learners are asked to store the actual linguistic input through which the 
meaning came (a verbatim memorization task), their job is more difficult. 
Verbatim learning is seemingly less natural than meaningful learning (e.g., 
Bransford & Franks, 1971), if requires more time (Jones & English 1926), It 
does not insure meaningful learning (Burger & Perfetti, 197,7; Jones & English*, 
1 926) and it does not persist'as well over time (Bobrow, 1970; Sachs, 1*7). 
Educationally it has been argued by Gagne (Gagne, 1976, Gagne & Briggs, 
1974) that it is generalizable intellectual skills which make the learner 
competent, not his store pf discrete, factual, verbal information. These 
intellectual skills are understands or meanings about concepts and rules 
for example which are usable by the learner in a variety of situations. Factua 
knowledge is certainly educationally important but acquiring a system of 
understandings (i.e.. meaningful learning) .has been argued by many to be a 
learning outcome of higher priority. 

Memory for Text and Learning Outcomes 

Memory for a reading event is constructive (e.g., Bransford, Barclay, & 
Franks 1972 Cofer 1973, Neisser, 1967). That is, readers don't interpret the 
semantic content of a message and store it as an isolated, independent event 
Rather the messageco>itent acts as a cue which readers can use to construct 
a semantic schema (re. Representation) of the information. These constructed 
descriptions ofteK contain more information than is represented by ; the 
l.nguistic input (inrW*(es, for example) and, therefore a purely linguistic 
analysis will not cfactVize all of the information available to the reader, 
prose memory goeVbeycind arfd enriches ,the semantic content presented. 
Supporting evider/ce corses from several sources. There is evidence for the 
recall of inferred rtieaningWrich fills gaps in the semantic continuity of the ex+- 
(Kintseh Kozminsky^Sterby, M)koon, & Keenan, 1975) and for the integration 
of the 'linquistic mftut into {complex, abstracted knowledge structures 
(Bransford & Franks.Tl971 , Tzeng, 1975). Individuals also construct memory 
structures through editing, summarizing, generalizing, etc in-order to 
•mm rnize information overload (Dawes, 1966; Freder ksen, 1975a; Gomu I ck. 
1956) and infer-and store intended rather than literal meaning when required 
(Clark & Lucy, 1975, Stillings, 1975). 

' The ability to reproduce, either by recognition or recall, discrete, .actual 
data corresponds to the lowest level of Bloom's (1956) taxonomy of cognitive 
objectives If other, higher level outcomes are available and if memory is 
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constructive such that t\ builds upon elaborated 'meanings which go beyond 
the expressed text, then me measurement of reading outcomes should 
express thest? phenomena if it does not, this type of measurement, at best, 
ignores much of a reader s memory schema for what has been read and 
emphasizes trivial outcomes Knowing the outcomes expected may force the ' 
reader to encode specifics at the expense of broader understandings. It may 
even misrepresent what the reader has acquired. Learning and memory 
expectancies are known to affect a reader's knowledge of a text (Gauld & 
Stephenson. 1967) and may even hide some of his knowledge of what has 
been read (Brockway, Chmielewski, & Cofer. 1974). 

The point being made is that the effects of a reading experience go beyond 
what is measured by accurate recall and recognition of factual data. The 
memory schema for a reading event contains mugh more than a replica of the 
stimulus material * Educationally relevant prose learning measurement, 
therefore, ought to reflect outcomes other than those at the lowest taxonomic 
level (Bloom, 1956) 

A Taxonomy of Reading Outcomes 

Since a reader's memory schema is a richly elaborated storehouse of 
information, it contains more than the facts that were expressed on the written - 
page Most prose learning studies, unfortunately, use factual verbatim 
information as the dependent variable of interest. It will be shown that other 
outcomes that are educationally important and reflect the constructed, 
elaborated and even productive nature of reading memory have been used 
only tco rarely. Certainly, though, Jhese other "higher level" outcomes are 
evident in the literature. Our review of the kinds, of dependent variables 
examined in current research along with current understanding of memory for 
text leads to a proposed logical sysjem or taxonomy of these reading 
outcomes 

This taxonomy categorizes four types of outcomes or classes of dependent 
variables memorization, translation, inference, application. These categories 
describe behaviors readers may be expected tg^exhibit after a reading event. 
They seem to represent dimensions of memory schemata which reflect the 
characteristics of information from prose which can be acquired, retained, 
and retrieved by a reader 

Memorization refers to the verbatim remembering of the text or portions of 
it Translation is recognizing or recalling information (i.e., meaning) in a form 
different from the manner in which it was expressed in the text. An inference is 
information contained in a memory schema which goes beyond and, 
therefore, elaborates the expressed text Application is the application of the 
information in the schema to new events; it requires the reader to transform 
the information in order to respond to new demands in the environment. 

The ordering of the taxonomic levels implies two different hierarchies. One 
hierarchy is grounded in a series of progressively more advanced levels of 
educational outcomes or goals For example, memorization refers to the 
retention of information, translation and inference toSts comprehension, and 
aDphcation to its transferability The other hierarchy suggests a progression of 
prerequisite sk'lis For example, the application of information lequires its 
comprehension as a prerequisite In the following sections, the levels of the 
taxonomy will be described in greater detail and exemplified by citations to 
current research studies 



Level 1 Memorization 

Memorization emphasizes the retention and retrieval of specifically 
expressed lafdrmatiqn from the text. The name given to this level i£ not 
completely satisfactory because of Us association in the general population 
* with the intentional act of remembering a text verbatim through repeated drill. 
The reader is expected to retrieve verbatim data in the form in which it was 
expressed in the text. If the linguistic form of the information changes (e.g., 
"10" becomes "ten" or "She generated various ideas" becomes "Many 
ideas were produced by her"), it has undergone translation and, therefore, 
belongs to another category of the taxonomy. 

Retrieval of this type ot information can be through either recognition or 
recall Boker (1974) has assessed recognition of specific facts and Mayer 
(1975) has likewise assessed larger units (e.g., definitions). Recall has been 
used to measure retention of specific facts with fill-in questions (Frase & 
Schwartz 1975), free recall of facts (Shimmerlik & NOIan, 1976), prompted 
recall of propositions (Ross & DiVesta, 1976) and free recall of entire passages 
(Jones & English, 1926). Typically, studies have measured the retention of 
facts such as names, dates, technical terms and quantities (e.g., Frase, 1975; 
Kaplan, 1974, Rothkopf & Bisbicos, 1967). Specilic, factual information is, of 
course, important and does serve a valuable role In education. It should not be 
the outcome emphasized, however (Ausubel, 1968). 

Level 2: Translation . , . i . . , ' . 

Translation is synonymous with paraphrase. Acquisition and retrievel ot 
paraphrases of text is considered by some to be required for understanding 
(Ausubel 1968) Acquisition of a paraphrase can be best described as 
'•meaning" that has been acquired which can t?ke any number of linguistic 
forms (Kintsch, 1974). Translation or paraphrase questions are considered by 
Anderson (1972) and Bloom (1956) to be techniques for measuring 
understanding Translation is changing a communication into parallel forms. 
Paraphrasing a sentence or summarizing a paragraph in one's own words are 
examples of creating parallel forms, towever, they do not even need to be 
similar in content form. A sentence (semantic) can be represented by a picture 
(figural) a set of directions by a map, etc. The major advantage of measuring 
translation outcomes is their sensitivity to comprehension rather than 
memorization (Bloom, 1956). The emphasis is on meaningful verbal learning 
rather than rote learning (Ausubel, 1968). These outcomes are usually 
assessed by having the reader use his own words to attempt to express tne 
"gist-of the text (e.g., Johnson, 1974; Meyer & McConkie, 1973). 

Level 3: Inferences , 
As stated earlier inferences make a significant contribution to a reader s 
memory schema for a passage. ThFs is known to be a developmental 
phenomenon which occurs early but becomes indistinguishable from adults 
inferential behavior at around eleven or twelve years of qge (Pans & Lrndauer, 
1976) Many types of inferences are possible as well as necessary. 
Interpolation and extrapolation of information is one type (cf. Bloom, 1956). 
Inferring an author's point of view, assumptions, mood, etc., are other 
examples. These kinds of reading outcomes are, of course, measured in some 
reading tests (cf. Davis & Davis, 1962). • 

Requiring readers to make inferences has been shown to influence those 
readers' retention of related material. In Rickards (1976) asking readers to 
infer that southern Mala is a desert increased their retention of the ideas.in the 
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paragraph over those who were not asked to make the inference. The results 
would be expected to be the same for readers who spontaneously made the 
desert inftMtMu e Ihe/e is data using other material which has supported 
this prediction (Christie & Schumacher, 1975) These results demonstrate that 
a is more than theoretically interesting to discover that readers spontaneously 
'generate and encode inferences during comprehension (Frederiksen, 1975a! 
Asking readers, to-go beyond the information given through adjunct questions 
(RothKopt. 1966) or test expectation (Brockway, Chmielewski, & Cofer, 1974) 
can facilitate text comprehension 

Level 4 Application 

Application accounts for the "productive" nature of. reading memory. That 
is, rather than simply being retrieved, information can be retrieved and 
transformed to produce new outcomes, information can be used in new ways 
or applied in novel situations. This qutcome assesses the reader's ability to 
solve new problems or explain phenomena using his new knowledge (Sanders, 
1966) This can range from using new concepts under novel circumstances - 
(Watts & Anderson, 1971) 'to solving new mathematics problems (Mayer, 
1975) 

The Classification of Current Research 

Table 1 provides an example of how some current research can be 
classified using the proposed taxonomy. Most of the studies were published 
during the period 1974-1976 in the Journal of Educational Psychology and are 
representative of current trends in prose learning measurement. Some earlier 
research has been included because of frequent citation in the more recent 
papers Often it was difficult to discover the nature of the reading outcome(s) 
measured in a research paper (e.g., Walker, 1974). When this was the case, a 
best guess was used. Otherwise, when a study assessed more than one type 
of outcome, it was classified according to the "most advanced" level used 
(e g , Felker & Dapra, 1975). 

Implications for'Research Design 

Table 1 graphically demonstrates the disproportionate number of research 
studies sampled here that measured what might be called lower level 
outcomes The higher, levels are significant dependent variables which, should 
receive more attention in the liteiature. Accordingly, future research should 1 
evaluate the effectiveness of their independent variables in terms of the more 
educationally meaningful and theoretically more valuable outcomes. 

The taxonomy also should provide a means whereby reseach can be. 
compared. This classification system would allow scholars to compare others' 
research in terms of their own ongoing work. In this way th* dissemination of 
results and the expansion of our knowledge might be facilitated. Probably the 
single most important contribution that this taxonomy makesrs^s highlighting 
of mferences as significant reading outcomes for prose learning research. 
There is much current work on inferential phenomena (Frederiksen, 1975b; 
Kmtsch, 1977) Future research which assesses them should carefully 
determine the nature of the inferences sought and just as carefully develop an 
instrument to measure them. This has not always been the case (McConkie, 
Rayner. & Wilson. 1973)., Studies should also explicitly report the nature of all ' 
dependent variables and instruments used to measure them. 



Table 1 

Research citations classified according to 
the tout levels of the proposed taxonomy 



LEVEL 1 MEMORIZATION 



Arkes, Schumacher, & Gardner, 1 976 

Bassin&Martm, 1976 

Boker, 1974 

Christie & Just 1976 

Duchastel & Brown, 1974 

Frase 1968 

Frase &Kreitzberg, 1975 
Frase &Schwaitz, 1975 
Gagne& Rothkopf, 1975 
Kaplan. 1974 
Kaplan, 1976a 
Kaplan. 1976b 
Kapian & Rolhkopf, 1974 
Kaplan & Simmons, 1974 
Koran & Koran 1975 



LaPorte & Nath, 1976 
LaPorte & Voss, 1975 
Rasco, Tennyson, & Boutwell, 1975 
Rothkopf, 1966 
Rothkopf &Billington, 1974 
Rothkopf & Billington. 1975 
Rothkopf & Billington, 1 975 
Rothkopf &Bisbicos, 1967 
Ross & DiVesta, 1976 
Shimmerlik & Nolan, 1976 
Siegel, Lautman & Burkett, 1974 
Snowman &Cunningham, 1975 
Voss, 1974 

Walker, 1974' \ 
Yekovich & Kulhavy, 1976 



LEVEL 2: TRANSLATION 



'Alien. 1970 
Andre & Sola. 1976 
Bower. 1974 
Crouse. 1974 
Hilier, 1974 
Johnson, 1974 
Jones & English, 1926 



LEVEL 3 INFERENCES 



'McConkie, Rayner, & Wilson, 1973 
Pickards, 1975-6 
Rickards, 1976 
Rickards &DiVesta, 1974 



King &" Russell, 1966 
Meyer & McConkte, 1974 
Rickards & August, 1975 
Royer & Cable, 1075 
Royer& Cable, ^97§ 

Shavelson, Berliner, Ravitch & Loedmg, 1974 
White & Gagne, 1976 



LEVEL 4: APPLICATION 



Duell, 1974 
Felker& Dapra, 1975 
Mayer, 1975 
Watts&Anderson, 1971 



A reader does mofe much more, than passively store independent 
linquiStic information. The actual linguistic data is only a vehicle by which 
meaning is conveyed, meaning whifch includes information that was not 
expressly communicated. The reader is also a user of his stored meanings; he 
applies it in new ways. As a result, research should be assessing these 
outcomes and classifying results in a fashion which facilitates the*' 
communicabllity. 
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Reliability of the cloze procedure as 
assessments of various language elements f 



As a measure of reading comprehension, the cloze procedure "...is 
possibly the most thoroughly validated and sophisticated method of testing 
presently used in education" (Bormuth, 1975'^ p. 82). However, investigations 
of cloze have frequently provided ' \ , . only correlational data between cloze 
test scores and comprehension' test scores ... in the absence of direct true 
score reahabihty coefficients'. . ." (Alameida, 1973, pp. 28-29). The purposes 
of this study were (1) to examine the reliability of the cloze procedure as a- 
measure of semantic awareness in post-reading situations and (2) to examine 
thel-eliability of cloze scores as an index of students' syntactic fluency. 

Method 

' The subjects for this investigation were 298 eighth grade students in 
Tucson Arizona Each student read two acience-related selections (Passage A 
about Cancer and Passage B about Superstitutions) determined to be of eighth 
level reading difficulty as predicted by the Fry Readability Graph (1967)i The 
study was conducted on two. consecutive days with the students randomly 
assigned to two groups. On the first day, one group read Passage A and 
responded to the cloze test for that passage; the other group read Passage B 
and responded to the cloze test for that passage. On the second day, the 
passages were reversed for the groups. 

The cloze tests were based on a fifty-item, every-seventh deletion pattern. 
The procedures for the students were (1) read the passage, (2) turn in the 
passage and get the cloze test, and (3) respond to the cloze test. No time limit 
was placed on either the reading or the cloze test. So that scores of Syntactic 
fiuefncy could be derived as well as scores of semantic awareness, students 
were directed to "Fill in every blank. Leave no blank empty, even if you have a 
take a wild guess." • . / 

Each blank x>n the cloze tests was examined for exact-replacement, 
acceptable synonyms, and syntactic acceptability. Synonyms were 
determined acceptable when agreed to by at least three of foUr evaluators. In 
the syntactic acceptability category, a response was considered correct if it 
agreed with the part of speech and the proper use in context of the deleted 

item. . j> 

Based on these categories, the following five scores were derived: (1) 
Semantically Acceptable, Exact-Replacement (ER); (2) Semantically 
Acceptable, Exact-Replacement-Plus-Synonyms (ER + SIMM); (3) 
Syntactically Acceptable (STC); (4) Syntactically Acceptable Minus Exact- 
Replacement (STC - ER); and (5) Syntactically Acceptable Minus Exact- 
Replacement-Plus- Synonyms (STC - (ER '+ SIMM)). Each of these scores was 
expressed as a percentage, using the following formulas: 
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ER 



ER + SNM 
STC 



ER 

50 

ER + SNM 

50 



STC 
50 



STC - ER 
50 - ER 

_STC- ( ER + SNM ) . 
50- (ER + SNM) , 

• \ 

\ 

All of these scores were examined for internal consistency reliability using 
Cronbach's Alpha. Parallel form reliability was determined with Pearson 
product-moment correlations. While specific results will be mentioned in the 
text, Table 1 provides a synoptic view of all the coefficients. 

, . Results 

Coefficients for the two semantic scores (exact-replacement and exact- 
replacement-plus-synonyms) ranged from ,86 to .92 for internal consistency 
and .79 to .81 for parallel form reliability. The difference between internal 
consistency -coefficients and parallel form coefficients was significant 
(p < .01) as determined by Feldt's W test (1969) and Fisher's Z test (Glass and 
Stanley, 1970). 

The internal consistency coefficients of the syntactically acceptable score 
were .95 and .94 for the two selections. The parallel form coefficient was .86. 
Internal consistency coefficients differed significantly (p < .01) from the 
parallel form coefficient. All coefficients for the syntactically acceptable score 
were significantly greater^ < .01) than the coefficients for any of the other 
types of scores. 

For the two syntactic scores derived by extracting part or all of the semantic 
scores, the internal consistency coefficients ranged from .67 to .81. For each 
"of these two scores, the coefficients for Passage A were significantly greater 
(p < .01) than the coefficients for Passage B. The parallel form coefficients for 
these scores were .73 and .67, and these coefficients were significantly lower, 
(p < .01 ) than their respective internal consistency coefficients. 



STC - ER 



STC-(ER + SNM) 



Discussion 

For all practical purposes, the reliability of these five types of cloze scores , 
is clearly established. However, four statistically significant findings deserve! 
examination. They are: * . $ J 

1. For all the^scores, the internal consistency coefficients are 
.significantly higher (p < .01) than the parallel form coefficients; 

2. The internal consistency coefficients of the two semantic scores are 
significantly higher (p<.0l) than the internal consistency of the 
corresponding syntactic scores; 
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Table 1 
Reliability Coefficients of 
Various Cloze Scores 



Internal Consistency Coefficients 





Passage A 
(Cancer)" 


Passage B * 
(Superstitions) 


Semantic. Exact-Replacement Only 


8b 




Semantic. Jtxact-Replacement-Pius- 
bynonyms 


.90 


.92 


Syntactic, Total 


.95 


.94 


Syntactic. Exact-Replacement Extracted 


81 


.72 


Syntactic. Exact-Replacement-Plus- 
Synonyms Extracted 


. .77 

V 


.67 


Parallel Form Coefficients 




r. 


corrected .r 


Semantic. Exact-Replacement Only 


.79 


.91 ~ 


Semantic. Exact-Replacement-Plus- 
Synonyms 


.81 


.89 


Syntactic. Total 


.86 


,92 


Syntactic, Exact-Replacement Extracted 


.73 


.96 


Syntactic, Exact-Reolacement-Plus- 
Synonyms Extracted 


.67 


.93 



3, 



The reliability coefficients of the total syntactically acceptable score 
are significantly greater (p < .01) than the coefficients of each of the 
other scores; and - 
4 When corrected for attenuation, the parallel form coefficients of the 
syntactic scores are significantly higher (p < ..01) than the parallel 
form coefficients of the corresponding semantic scores. w 

Passage Dependency * 

The significant differences between the internal consistency coefficients 
and the parallel form coefficients suggests that all of these close scores may 
be somewhat passage dependent. For most practical purposes, given the 
relatively high parallel form coefficients (.67 to .86), the degree to which 
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reliability is passage dependent is likely to be of minimal concern. However, 
th.-s discrepancy may be of particular interest to a researcher who is 
concerned with i 1 ) whether this tendency toward passage dep^ndendy is 
representative ot th»> (.onMruct underlying cloze as a measure of various 
language factors .and \2) it it is representative, why this might be so. These are 
issues of, validity and cannot be answered by these findings. What these data 
do suggest 'S that all of these scores are dependent upon trie passage used 
and a caution must be raised about the generahzability of cloze scores. This is 
a J reasonable caution. lor the use of any score of reading comprehension since 
comprehension (hpwever defined), may be specific to a given passage. To 
generalize students' ability on the basis of performance on a single test of any 
nature is hazardous at best, and to do so on the basis pf a clo2e test is no 
exception 

Differences m Internal Consistency 

Tne significant differences between the interna! consistency coefficients of 
the semantic and corresponding syntactic scores sucigest that the semantic 
scores are more reliable. Such a conclusion, however, is probably not valid 
due to the derivation of those syntactic scores that are derived by extracting 
pan or ail of the semantic scores. In the process of extracting the semantic 
scores the variability within the resulting syntactic scores was substantially 
restricted 

The implications of this restricted variability become clearer with a 
recognition mat these syntactic scores are based on an unequal number of 
responses s"<nce the extraction factor for each individual varies considerably 
depending upon the number of responses that were semantically acceptable. 
Thus, the number of responses considered in these syntactic scores is lower 
than the consistent number (50) of scored semantic responses. This tends to 
r educe the internal consistency coefficients of these syntactic scores 
because the potential variability is both inconsistent and less than that 
potential for the semantic scores. 

In effect,, then, this seems to be a statistical finding due more to a scoring 
procedure than a finding with practical implications. Given this computational 
limitation, it is likely that the internal consistency coefficients for these 
syntactic scores are deflated, and if some reasonable means could be 
identified to correct for thdfoimrtatioji, these coefficients would be substantially 
higher. Unfortunately, there is apparently no way to correct for this problem 
without significantly increasing the -length of the cloze test, and.that seems to 
be unnecessary in light of the relatively high coefficients. 

This limitation of restricted variability may become even clearer when 
examined exclusively within the syntactic scores derived by extracting part or 
all of the semantic factor. Among the internal consistency coefficients of 
these two scores, a significant difference exists between Passage A and 
k Passage B This is not considered an important finding; yet it is worthy of 
discussion as clarification and as an example of anpther issue related to cloze 
scores 

For Passage A the coefficients of internal consistency for tfyese two scores 
were .81 and 77, but for Passage B they were .72 anci .67. Those for Passage 
A were'Sigmficantly higher (p < .01) than those for Passage B as determined 
by Feldt's W test One might be tempted to surmise that tfiese results support 
the premise that reliability of various cloze scores is passage dependent. 
However, this difference becomes clearer when examined in relation to the 
issue of restricted variability. 



The percentage means of the two semantic scores for these passages were 
respectively J9 and 42 for 'the exact replacemenf and 35 and 58 for the exact 
replacement pur. vyiu'myms Clearly, Passage B was easier for these students 
than Passage A irrespective of the determination by Fry's graph that these 
two selections were of eighth level difficulty. Because the students obtained 
higher semantic scores on Passage 3, when those scores were subtracted to 
determine the corresponding syntactic scores for Passage B, the remainder - 
was significantly less than-the remainder for Passage A. The result was that 
the variability within these two syntactic scores for Passage B was restricted^ 
Tms »n turn deflated the internal consistency coefficients for Passage B. Had 
'this restricted variability not existed, it is likely that these coefficients for the 
two passages would have been as akin as the corresponding coefficients for 
tne semantic scores Thus, it would be inappropriate to support the premise of 
passage dependency with the reliability of these scores 

Syntactically Acceptable Score 

An intriguing prospect is offered by the syntactically acceptable score m 
ma? it may be the mos£ accurate index > of students' comprehension as 
measured by cloze scores because it does include both syntactic and 
semantic awareness What is meant by reading comprehension is an issue to 
he debated however, one perspective, typically called "psycholinguistic, 
suggests that >t should include semantic understanding and syntactic fluency. 
The cloze 'est m general, requires that a student utilize understandings about 
the structure of language and the content of a specific selection to replace 
items that nave been deleted from the selection. Given \he exceptionally high 
reliability coefficients for the total .syntactic score and given that those 
coefficients are significantly higher than all the other scores, It is not an 
unreasonable hypothesis that the total syntactically acceptable score is a 
more accurate mdex pf reading ability than other scores as determined by the 
csoze procedure it would prove to be a cumbersome procedure tp be used by 
classroom teachers if they were to analyze responses for syntactic 
acceptability but that is not too 'much, to ask of researchers if this score were 
found to be a more accurate index of reading comprehension than the other 
scores To ignore syntactically acceptable responses on a cloze test seems 
tantamount to discarding pertinent information about the students 
understanding of 'the selection, especially if certain psycholinguistic 
assumptions are accepted. Since implications cannot be extended beyond the 
existing data, the most that can be suggested is a need for a construct 
validation of the total syntactic score, along with the others discussed here, to 
shed more light on what the various scores are revealing 4bout Students 
reading comprehension 

'Parallel Form Coefficients . , t , 

The significant difference between the parallel form coefficients of the 
semantic and corresponding syntactic scores may also have( implications for 
the construct validity of these various cloze scores. Initial observation of these 
coefficients indicates that the coefficients for the. semantic .scores are 
siqmficantly greater than those for the syntactic scores. This can best be 
explained by the deflated internal consistency coefficients caused by 
restricted variability in the syntactic scores. That the internal consistency 
coefficients are deflated has a cohcommitant effect on the parallel form 
coefficients Thus to examine the relationship of the parallel form coefficients 
of the two scores appropriately, it was necessary to correct the parallel form 
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coefficients for attenuation (Nunnally, , 1967) given that the internal 
consistency coefficients for the two selections were notperfect. , 

When corrected tor attenuation, the parallel form coefficients indicate that 
the syntactic scores are significantly (p < 01) more reliable than the semantic 
scores An explanation for this is that the syntactic scores- derived by 
extracting the semantic variable does indeed assess a syntactic factor, If it 
does, it is reasonable that the syntactic score would be more reliable across 
passages (parallel forms) because the structure of the I'anguage remains 
relatively constant while the content of the selections (semantic factor) tends 
to vary 

Conclusion 

Ail cf the cloze scores examined in this study are highly reliable for practical 
purposes The significant differences between internal consistency reliability 
ana parallel form reliability lead tox)ne primary implication. Caution must be 
exercised when using cloze scores based on a single presage to draw 
conclusions about students' reading ability. It follows, therefdre, that multiple 
cioze selections should be used for accurate assessments of reading ability. 
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Stability of cloze scores across 
varying deletion patterns 

Developers of cloze instruments have proposed a variety of procedures for 
selecting words to be deleted from a passage. Taylor, in his introduction of 
the cloze procedure (1953), depended on the principles of statistical random 
sampling for selection of words to be deleted. Random deletion is accomplished 
by using a table of random numbers to delete words up to a predetermined 
percentage of the words in. the passage, e.g., a random 20 percent. More 
recently various modified cloze procedures have been proposed; however, 
the predominant number of studies use some approximation of random deletion. 
Random deletion has typically been approximated by deletion of every nth word 
with the value of n specified by the clOze developer. 

Every ntti word deletion patterns have been employed frequently in research 
studies where true random deletion patterns have been virtually ignored. Two 
primary reasons may be presented for use of nth word deletion patterns 
rather than random deletion patterns. 

1 Developers consider the nth word- deletion patterns easier to construct 
than random deletion patterns. 

. 2 Researchers (Fillenbaum, Jones. & Rapport, 1963; MacGinitia 1961) have 
concluded that four or more words are needed between blank^in order to 
obtain the most reliable and presumably valid measures. A conclusion based - 
on these findings has been that items attain their maximum independence from 
one another when a minimum of four words intervene between each item 
(Bormuth, 1975). In true random deletion the number of; intervening words 
between items may vary from zero to a large number of words. 

r/iven these advantages one might still- question whether a form of systematic 
s/nphng such as nth word deletion should be used in measurement of a con- 
struct which is potentially cyclic in nature and could result in biased samples. 
/Researchers and practitioners alike are interested in a stable as well as reliable 
/ measure of a subject's ability to read a passage. That is, different forms con- 
/ structed on the same passage should not only rank the subjects the same 
(reliability) but also provide the same index of a subject's ability on that passage 
, (stability) The stability of nth word deletion has already been questioned in a 
/ study by Bormuth* (1 964) where- he varied the starting-point of an every-fifth 
/ word deletion pattern to produce all five possible forms for each of 20 passages. 
Significant differences were found among the means of the various forms. 
Bormuth concluded that more than one croze test should be constructed for a 
given passage when using nth word deletion patterns. If one follows Bormuth s 
- recommendation of multiple forms, the first advantage of nth word deletions, i.e., 
ease of construction, is eliminated. One rarely finds multiple forms being used in 
the literature possibly due to its impracticality. Given that one form and one form 
< only is usually developed over a passage, investigations of the stability or 
: • the resulting index are needed comparing nth word deletion patterns to random 
word deletion patterns. / ....... 

The second advantage of nth word deletion patterns, \ maximum reliability, 
has been built on evidence from research on nth toord deletion patterns 
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oniy When patterns other than nth word deletion patterns were constructed 
for comparisons these patterns were usually of the selective or rational deletion 
vanity L 'tUeJfrt?i ' r'vidence exists which compares the reliability of cloze tests 
based on n\h wo'U deletions to (he reliability of cloze tests based on random word 
deset-ons If the conclusion concerning mgximum item independence is correct, 
then one w'ould expect nth -word deletion oatterns to result in more reliable 
measles tnan random word deletion patterns 

The purpose of this study was to investigate the stability and reliability of 
scores resulting from cloze tests based on nth word deletion patterns as 
compared to cloze tests based on random word deletion patterns. Stability 
was assessed by comparing the variability of means resulting fr,om the different 
forms which could be developed using i specific deletion, pattern. Reliability 
was assessed using Cronbacto's Alpha, an index of internal consistency. 



A reading passage was chosen which was estimated to be at the sixth grade 
readmg "level as determined by the Fry Readability Graph (Fry, 1968). An. 
■every-fifth word deletion 'rate was used for the nth word deletion pattern as 
this, rate appears to be the most, widely used and accepted deletion rate. All 
five 4 possible cio/e test forms using as every-fifth word deletion pattern were* 
developed, over a 250 word segment from t(ie selected passage. Five additional 
cloze test forms were developed by, deleting a random sample of 50 words 
from the same 250 word segment. The words deleted for each of the random 
deletion patterns were determined by drawing a sample of 50 numbers from 
-1 to 250 Words in the segment were : deleted which corresponded to these 
50 numbers A minimum of 25 words preceded the first blank and minimum of # 
25 words followed the last blank of the segment. 

The ten cloze test forms, five every-fifth word deletion test forms and five 
random deletion test forms, were randomly assigned for administration to 446 
seventh and eighth grade students at a junior high school in Tucson, Arizona. 
A sixth gracte passage was used as the basis for the tests as standardizeo 
tests indicated the school to be somewhat below national average. 

The tests were administered without pre-reading. Students were directed to 
fill in each Qf 'the blanks with the word that they thought best belonged in the 
blank Students were encouraged to fill in every blafnk even if they were not 
sure of what word should go in the blank. Exact word replacement was used 
in scoring the tests. ^ 



Table 1 reports the means, standard deviations, and Cronbach's Alpha for 
each of the forms for each deletion pattern. 

A 2x5x2 (Deletion Pattern x Form x Grade) factorial analysis of variance was 
used to analyze .the differences among the means. A factorial analysis* was 
selected to allow investigation of the grade effects. Pattern by form interactions 
will have no meaning as there is no basis for particular combinations of forms 
across deletion patterns. . 0 * 

The passage, as would be expected, was more difficult for seventh graders 
than eighth graders.F(J ,426)- 1 2.60,p< .01 . No interaction existed with grSde. 
Therefore the effects of pattern and form are generdlizable across grade. The 
pattern main effect and form main effect are not directly interpretable because 
of the significant interaction between them, F (4,426) = 6.55, p <.01 . Tukey 
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Table 1 



.Deletion 
Pattern 



Every- ''. 

Fifth 

Word 



Random 



' Means, Standard Deviations, and Crorttiach Alphas ^ 
tor Each Formwithin'Eaqh Deletion Pattern 



Forms 
r - 2 



x M 


-24.8 


18.8 


'50 - 


7.7 


7.8 


Alpha ■ 


.86^- 


.90 


M ■ 


■ 17.8 ' 


19.6 


SO 


6.6 


. 7.0 


Alpha ' 


' ' .85 


.85 



3 


. 4 


5 


dc. 1 
5.8 . 
.80 


17/1 

•7.8 
.89 


1 7 Q 

I I .0 

6.7 
.84 


16.3 
.8.2 
• .96 : 


.19.0 , 
98 

,93 


• 15.9 
6.9 
N .84 



posthoc procedures were used to compare form means with one anotheNcjthm 
deletion' pattern. Of thexten comparisons -possible among forms within Trie 
everv-fifth word deletion pattern, five were significant (p.<.01). None of the en 
possible comparisons were significant for random deletion forms-even af the 

0 The V ^anqe df Cronbach Alphas was from .80 to .90 for the every-fifth word 
deletion patterns and from .84 to .93 for the random deletion patterns 
The average alpha for fifth-word deletion patterns was .858 while the average for 
random deletion patterns was .874. One may certainly conclude that the Alphas 
for every-fifth word deletion tests are not significantly greater than Alphas tor 
random deletion tests. ' 




Table 2 

Analysis of Variance Summary Table 



df 



Sum of 
Squares 



Mean 
Square 



'Probability 



Deletion Pattern 

Form 

Grade 


1 
4 
1 


653.15 
986.95 
701.37 


653.15 
246.75 
701 .3> 


11.73 
4.43 
12.60 


.001 
.002 
.001 


Patterr)x Form 
Pattern x Grade 
Form x Grade 


4 
1 
4 


1457.77 
10.62 
59.77 


364.44 
10.62 . 
14.94 


6.55 
.19 
.27 


.001 
.663 
.898 


Pattern x Form x 
x Grade 


. ' 4 


128.74 


32.19 


.58 


.679 


Error 


426 


23712.29 


. 55.66 







183 



l'J3 



, Discussion and Conclusions 
% » 

As in Bormiith's study, significant differences were found among the means 
from the five every-fifth word deletion cloze tests. Yet, no significant differences 
' were found among means frpm five* random deletion cloze tests. This indicates 
that random deletion patterns result in a more stable index representing 
students' abilities 'on a particular passage. Therefore; if-orvly one cloze lest is 
to be constructed over a passage then a random deletion pattern would be more 
desirable than an every-fifth .word deletion pattern/ This is especially True when 
one considers that no sacrifice in reliability is made. " 

The range of means for the every-fifth word deletion forms was 7.5 points^ 
or a difference of 15 percent.. The range for random cleletion means was 3.7 
points or - 7.4 percent. This percentage range .woufd ■ make ^considerable 
difference if one is using criterion 6utofts for determining frustrational, instruc- 
tional, or independent levels for a passage. Again random deletion patterns 
would be more desirable than every-fifth word deletion patterns. However, new 
criteria for cutoffs would need to be developed because of the random 
deletion tests being more difficult than every-fifth word deletion patterns. 
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- The reliability of the Maze procedure 
for intermediate and junior high school students 

frssuminq that reading primarily involves the construction of meaning: from 
priS language, the valid assessment of comprehension ,s crucial for . 
app Sria^e reading instruction.. In spite of an obvio.us need there ,p esentty 
appears to be no practical, reliable, arid valid procedure available which can . 
be used by a teacher to assess comprehension during instruction. ~ Tho 
T/o be of practical value, a tesLshould have the following attributes, (a iThe 
for/nat shouS be uncomplicated, (b) The test should be easNy administered to 
gZps requiring a reasonable amount of time to give, (c) goring should be. 
oSve (d) The test should result in the same outcomes no matter who con- 
sSucts it Test reliability mustexisfboth in terms of consistency among items 
2 thin -a test and comparability of dala obtained from tests constructed on , 
parallel passages. Validity exists if the test data can be appropriately. used for 
decision making in the instructional setting. . , 

One informal means of assessing ^lnfl-OQmpr^r^^app«w to be 
practical for classroom use is the maze procedure, a modification of the cloze . 
D ocedure Guthrie, 1973). The maze procedure utilizes multiple-choice items 
whife the c oze Procedure utilizes completion iterris Guthrie fecomends i three 
Sons per item- (a) the correct word, (b) a syntactically incorrect, word, and 
(c) a syntactically correct but semantical^ incorrect word. , 

The length suggested for a maze test passage is 140 to 160 words,, wm 
eve? Nifth to every tenth word replaced with a maze mult.ple-cho.ce item. 
Accord ng to Guthrie. Seifert, Burnham and Caplan (1974) a maze test can be 
•admStered easily to a group in ten to fifteen minutes and a lows for 
objective scoring. In the same paper, they also suggested that teachers can be 
easil v trained to construct valid maze tests. ^ . 

tone evidence concerning the reliability and va.i dity of ^ ma^e p recedes 
currently available. Gallant (1965) found a maze type test to have high internal 
consis tency and adequate concurrent validity for a sample of first grade students. 
X Z /^^administered maze tests to 12 primary grade and 24 intermediate 
graSe TJSSSL anSobtatned hlgnhternal consistency and acceptable concurrent* 
Sy Brad ey Ackerson and Ames (in press) found adequate , maze P ocedure 
pa a lei form rei ability with a sample of 246 second grade j students Other than 
'* hrnited pSn aF y evidence offered by Guthrie (1 973), no documented evidence 
KiSlSlSSSrnlnj, the reliability and validity of the maze procedure for use 
.with intermediate and junior high school students informal readina 

Standards of mastery are also an important aspect of an informa reaoing 
tesV GuthhJs suggested maze test performance standards for the instructional 
leading level are 50 to 85 percent accuracy, although no evidence was offered 

^Themaze'procedufe appears to hold promise as an informal reading com- 
nrehens^on'es for use to initiate instructional placement and monitor progress. 
The S study was undertaken for the following purposes: (1 1 The rehabjhty, 
of mtze procedure forms- administered to intermediate and Jun.Or high school 
Sudents was studied. It was hoped that the present study would yield mate 
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forms sufficiently reliable to make possible future research Into maze procedure 
validity. (2) Placement decisions resulting from ma/e procedures wer.e compared 
to placement decisions resulting from cloze procedures on common passages. 
The cloze procedure was selected as a measure for comparison with maze t 
because cloze instructional placement standard^ have been empirically derived 
(Bormuth, J967, 1968; Ransom, 1968) and the cloze pfocedur^ appears to' 
provide a N vali(J .rneasure of reading comprehension (Bdrfnuth, 1963, 1969a, 
'1969b; Gallant, 1£65; Jenkinsen, 1957; Rankin, 1957; Taylor, 1956, 1957), 
{3) An item analysis was conducted to investigate the adequacy of Guthrie's 
. recommended maze test option 1y pes. f4) The consistency of doze scores and 
maze scores across parallel passages was determined. » ' 

Method 1 ' 

Materials ■ . 

Six reading passages, each consisting of at least 300 words of running 
text, were selected for use to construct the maze and cloze tests. Two different 
passages (Passage A and Passage B) were selected for use at each of three 
grade levels (fourth, sixth, and eighth). The passages Osed to construct the 
fourth and sixth grade tests were taken from the fourth and sixth readers of 
the Houghton Mifflin Reading Program (Durr, Windley & McGourt, 1971). The 
two passages selected for use at eighth grade were taken from appropriate 
juvenile reading sources. The Fry Readability Graph (Fry, 1968) was also used 
" to aid in the selection process. 

The following procedures were employed separately for each of the six test 
passages. The first and last sentences in each passage were left intact.. The 
4irst and last sentences in e'ach passage were left intact. The words ir* each 
passage identified for use as items were selected using an every seventh-word- 
deletion pattern. The first word to be deleted was selected using random 
procedures. A ma^e test was constructed by replacing each identified word with 
a maze multiple-choice 0 item. A .cloze f test was constructed by deleting each 
identified wore* and replacing ft witff a blank of fifteen spaces. This procedure 
resulted in 'two 40~item m^ze forms (Maze- A and Maze B) and two 40-ifem 
cloze forms (Cloze A and Sloze' B) prepared for each of three grade levels 
(fourth-, sixth-, and eighth-grade)" Exact word scoring was used with the cloze 
' test. 

Subjects * 

The sample consisted of 335 fourth, sixth, and eighth grade students attending 

'four schools in Tucson, Arizona. The sample consisted almost entirely of 

Anglo-suburban dwelling students from middle income families. 

v 

Procedure * ; 

During May, 1 977, the maze and cloze tests were administered to the sample. 
Intact classes were randomly assigned to either procedure. A short training 

1 session was provided to familiarize the siudents with the purpose and format of 
the testing procedure (either maze or cloze). Each student received two forms 
to complete, one form for each test passage (Form A and Form B). 
Half the students first completed the test forms for Passage A while the other 

" half first completed those for Passage, B..The particular form (A or B) given 
first to a student was determined randomly. The students in the maze groups 
were given directions consistent With those suggested by Guthrie, Seifert, 
Burnham and Caplan (1974). Thbse in the cloze groups were directed to, 



-Fill in each cloze blank with the word thpt makes the best sense in terms of 

the story." • ^ " 

. Results and Discussion ' ^ . 

Table 1 reports numbers of subjects, means, standard deviations, and Cron- 
fcach Alphas for each form of maze and cloze for each grade level t 

♦ Tablel ^ * ' * 

> ' Means, Standard Deviations and Alpha Coefficier>ts [ 
Fcr Cloze and Maze Passages 



Grade Level ' 
and Form 


Cloze 


I. 

Maze 


Mean 
Cloze Maze 


Cloze 


SD 

Maze 


Alpha' , 
6ioze Maze 


Fourth Grade 


















Form A 


56 


59 


17.7(44%) 


37.4(94%) 


4.8 


2.6 


, .76 


.71 


^Form B 


56 


59 


'15.0(38%) 


33.1' (83%) 


' 4.6 


3.7 


.75 


.73 


Sixth Grade 


















Form A 


57 


48 


\ 0.2 (26%) 


34.8(87%)- 


3.5 


■i 's.r 


» .65 


V:74 


Form B 


57^ 


48 


1 2.6(32%)*/ 


35.6(89%) 


A 4 




' .74 


.85 , 


' Eighth Grade 














* i 




Form A fc 49 


66 


16 1 (40%) 


36.3(91 %) 


5.5 


3.8 


, .80 


.80 


Form B" 


49 


66 


18.5(46%)' 


38.5(96%) 


6.8 


1.8' 


• .87 


.62 : 



Constructing maze tests using Guthrie's recommendations 'resulted in tests f 
which were considerably easier than corresponding Cloze tests Every maze test 
had a ceiling effect where scores Were approaching the upper limit of One score 
range. Contrasted to this are the cloze passage means which were at the 
ODDOsite end of the difficulty continuum. „ . • ' . 

Maze and cloze scores converted to percentages were interpreted in terms 
fc of available functional reading level criteria for the maze pr ocedure^thrie et. al., 
1974Vand the cloze procedure,(Bormuth, 1967; RankirT&Culhane 1969). ~ 

The maze mean scores oh five, of ' the 'six" forms ranged from 87 to 96 
percent identifying students at the independent reading level. The one excep t on 
' was the mean score obtained with Maze Form -B af the fourth grade eveL 
That mean score (83 percent) fell near the upper limits of the instructional 

'Different conclusions concerning student placement were generated by the 
cloze procedure results. Cloze forms at the fourth apd eighth grade levels 
placed students at the instructional level (38 to 46 percent) while ctozej ferns 
at the sixth grade level placed students at the frustrational level (26 to 32 percent). 

The. ease of the maze task resulted in restricted variability fn Form A. grade 
four and Form B. grade eight. Variability was most restricted in the cloze, 
scores at the sixth grade level because "of the difficulty of the task 
The alpha coefficients were found to be directly related *o the ceiling effect 
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- obtained with the maze tests and to the floor effect obtained on the sixth grade 
cjpze tests. To the degree that variability was restricted, internal consistency was 

' lowered. The two lowest indices within the maze'tesfs (Maze Form A, fourth 
^rade and Maze Form B, eighth grade) were where the ceiling effect was most 
pronounced. The-two lowest indices within cloze tests were at sixth grade where 
the floor effect was nr^t profjounced and variability was most restricted. 

. As indicated by the high maze test means, maze^procedure items were very 
easy. Therefore the Jest did not discriminate among subjects. Options must 
be redesigned for these age levels in order for the maze procedure to become 
' a viabie testing procedure. Daia from ihis study provided iwo ciues for option 

" redesigp. { 

1. Incorrect options were not attractive ?s would be required for an 
adequate testing procedure. Forty to si*ty percent of the»subjects should 
respond fo incorrect options for an item to have maximum efficiency as a 
discriminator (Thorndike, 1971). Recommended maze ^Option distractors 
did not meet this criterion as an average of only 1 7% of the responses were 

* drawn to the distractors. » ' f 

2. The'Syptacticallycorrect, semantically incorrect option was found to be a 
more effective distractor than th§ syhtactically incorrect option. Approxi- 
mately two-thirds* of the incorrect responses were of the syntactically 
cotrect, semantically incorrect type. Parallel form reliability coefficients 
were computed for grades four, six, 'and eight* for cloze (.65^. .51 , and .70 ' 
respectively) and maze (.58, .59, and .50 respectively). These reliability 

- coefficients were found to be directly related to internal consistency 
coefficients which, in turn, were related to, restricted variability. Differences in 
parallel form reliability coefficients can be, to a large extent, explained by 
the restricted score range. 

. f 

Conclusions and Recommendations 

The maze procedure as proposed by Guthrie appears to be inappropriate for 
use with intermediate and juniqr high school students/ Maze procedures resulted 
in student placement predominatly at the independent level while cloze proce- • 
dures constructed on the s^me passages resulted in student placement at the 
instructional or frustration level. The use of the maze procedure produced a ceiling 
effect which reduced score variability, parallel form reliability, and therefore 
the test's ability to discriminate. 

It is possible that with certain modifications, the maze procedure could be 
made appropriate for use with intermediate and junior high school students. 
Possible maze procedure 0 modifications might include: (a) discarding the option 
type utilizing a syntactically incorrect word, (b) devising new option types, e.g., 
semantically correct within sentence but semantically incorrect within passage, 
(c) iocfeasing the number of options per item.- 
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The use of an unobstrusive screening device to : 
approximate reading le vets of adults 

The need for a minimal 4 ' Jhreat reading evaluation instrument vhas been 
recognized by adult educators for some time (Fay, 1966; Otto & Ford, 1967; 
Bowren & Zintz, 1977). In recent years, there have been several attempts to 
devise sUch an instrument (Smith, Cook, Donaldson, Morrow, Stayman & 
Bracitmuellar, 1965; White & Kirkland, 1971; Leibert, 1972; Rakes, 1973; . 
Brown, 1974). However, the majority of Jhese attempts have utilized formal 
testing situations and required Jhe adult student to read orally. 
1 In order to' reduce the possibility of , interference from test anxiety, an 
unobtrusive screening device' (Webb, Campbell, Swartz, ? & Sechrest, 1972; 
Farr, 1970) yvas developed for use within an 'adult .education center. The 
^device was designed as a student survey and used to approximate reading 
Jevels without formal testing. 

Scores- were expressed in grade levels 0 • 9, with the 9 level indicating a 
functional adult reader. Specific diagnosis was not the intent of this "testing'*, 
but rather a Screening process to identify reading levels of entering students. 

The device, the West Informal Reading Evaluation (WIRE) (West, 1 977), was 
disguised as a student survey and presented in three brief, separate passages 
and written in relation to.adult school and student classes. The passages were 
written at the 3, 6, and 9 grade levels and arranged in ascending order of 
difficulty. Readability was determined with the use of the Maginnis (19.69) 
adapted^formulalion of the Fry Readability Graph (1968). Content oMhe 
passages elicited a brief written response (Betts, 1957) from the student, 
which served as the basis tor evaluation. 

» ^ 
- f * * Method 

Subjects • - , 

One-hundred fifty-four adults, including both Basic Education students as 
v , -well as Adult High Sghool Completion students, completed form-1 of the WIRE. , 
* ' One-hundred* twentyrthree ofMhe same students were then tested using the 
Reading for Uhderst&nding Test (RFU)(Thurstone, 1963). The RFU was utilized * 
* as the validity criterion and was one of the instruments used jn this particular 
center. One-hundred of the original 154 students .were randomly asked to 
complete a 'parallel form of the WIRE, the WIRE-2, in order to determine 
reliability. Students ranged in ages from 16 to 70 with the average age being 
' Jo years. Black ancf whits. Americans constituted the largest portion of 
students with only a small segment of foreign students represented. 
Approximate distribution of njales to females was 40% to 60% respectively, 
i i 

*■ • ' * 

f Procedure • 1 

The WIRE-1 was group administered to the students by their classroom 
teachers. The instrument was distributed as a student survey and all students 
were encouraged to Share their individual ideas, views and opinions. 
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Distribution completion and collection of the WtRE,took approximately 15 to 
20 minutes with a class discussion of the passage topics following the 
collection of response sheets. The discussion" was. included in-order that all 
students'would be able to express their views whether or not they were able to 
read all of the passages. Students were told that the survey was A also a means 
of estimating how well they read and that this would enable teachers to select 
materials, plan classes and help students to register more accurately. 

Due to the unobtrusive nature of the instrument, it wias anticipated mat 
some 'adult students would object to being tested without giving their consent 

.1. - 1, r~ Q ~*;~~r> , .,^r^ n ^t^ ^nH moot ctiiHontc \r#& r q CUHOUS 

as to how well they had scored. . * 

- WIRE scores were based upon the correctness, feasibility and 
completeness of the student's response to each passage. Estimated reading 
levels were indicated~by scores described in the following scale: ' ^ 



0 no apparent reading ability • 

- 3 below a 3rd grade level reading ability >< , , 
3 approximately a 3rd grade reading ability 

3 + better than a 3rd grade level, but not more than a high 4th grade level 

- 6 below a 6th grade level, but not lower than a high 4th gradejevel 
6 approximately a 6th grade level ability * f 

6 better than a 6th grade level, but not more than a high 7th grade level 

- i lower than a 9th grade level ability/but not lower than a high 7th grade-level 
9 approximately a 9th grade !ev& ability 

Occasionally readers responded correctly and exceptionally well at the 
level 6 passage, but entered a "no comment" at the level 9 passage; a score . 
of 6 + seemed to be a low estimate and a r9 possibly a closer approximation 
• of the reader's true ability. Judgment in borderline situations, was enhance^ 
with scoring practice and was not as intuitive or ambiguous as it would appear. - 

Results^ 

Validity coefficients of- .868 and .818, respectively, indicated that the- 
wmE<-1 and WIRE-2 were both valid measures of adult silent reading 
comprehension as related to the RFU. Reliability of the WIRE scores was 
determined at the .847 level based on the correlation ol 10O paired WIRE 

scores ' 

In order lo determine the inter-rater reliability of the WIRE, two teachers 
(one trained the other untrained in scoring the WIRE\ rated 40 randomly 
selected surveys .Their ratings were correlated with the ratings Of the 
researcher and resulted in coefficients ranging-from .823 to .949 which were 
significant at the .01 level. Table 1 lists the inter-rater means, standard 
deviations, and range of scores. .» 

The passages of the*W/ftE, a total of 6 for both forms, also elicited important^ 
data other than approximate reading levels. Subjective responses provided 
information that characterized a specific adult education population and lent , 
insight into adult students in general. Areas of inquiry included means of 
transportation used by the student, educational preferences, vocational goals, 
arid curricular needs as related to career plans. • 

Discussion 

One of the advantages of using aainformal method for assessing reading 
levels was that it could be developed^and tailored to fit a specific population: 



Table 1 



Means, Standard Deviations and Range of Scores of Raters on 
Forty W1RE-2 Tests 



Rater 


Mean 


Standard 
Deviation 


Range* of 
Scores , 




1 (Researcher) 


f 


1 .HO 






2 (Trained) 


8.00 


1.33 


■ 4.0 • 10.0 




3 (Untrained) 


7.85 


■\ 1.73 


3.0-10.0 





The WIRE was one form of an instrument that could be teacher constructed 
and adapted to any program or locale. With intervals of 0,3,6, and 9, Xhe.WtRE 
was not sensitive efnough to distinguish' small increments of growth within 
short periods of time. Hpwever, with some adult students who progress rapidly 
it might be possible to use a form of the WIRE every three to four morfths. The 
WIRE also has potential for assessment of writing, spelling and grammar. 
H6w$ver, much more would be needed in the way of effective scoring 
procedures for accurate Analyses. 

The use of an unobstrusive screening device, such as the WIRE, would be.a 
feasible method of use with adult students, children, and adolescent?. 

✓ » 
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.Future shock and the aged: 
is reading a cure or part of the problem? 

Future shock Alvin Toffler's term for culture shock in one/s ownsociety, is 
"the shattering stress and disorientation induced in individuals by subjecting 
them to too much change in too short a time." This rapid rate of change has^ 
left all members of society witMhe problem of adaptability. For the elderly, the 
shock is in many ways greater than for younger people because they have 
been* witness to many more changes than the younger generations. In 
addition time seems to pass more quickly for the elderly and the ability, or 
perhips the desire, to adjust to change wanes. For the elderly who are moving m 
from a position as functioning members. of society to retirement, adaptation 
problems may be magnified as they struggle to adapt to both a new life style 
and a fast moving society. These things, taken together, magnify the effects of 
future shock on the elderly. Withdrawal, disorientation, and depression are thq 

effects » 

While reading is no elixir, it can serve to assuage the adaptation problems of 
the elderly Reading can serve to Tncrease consumer awareness among the 
elderly Recreational reading can be a method of passmg-the empty hours of 
retirement. Paradoxically, however.^ie very products available for reading 
contribute in some ways to the adaptation problems of the aged. [ , 

Reading as a Cure 

Reading Provides Consumer Information * 
. in their adaptation to retirement, the elderly are faced with all types o 
problems and the specific ones facing each person depend on the level of 
sophistication with which he reads. One-fifth of all people over §5 are 
functional illiterates (Eklund, 1969). This group of aged people suffers the 
greatest adaptation problems in retirement, for the very business of life 
require! that one be able to read. The very basis of getting an income jn 
retirement (Social Security, Medicare, and insurance forms) revolves around 
reading, and being able to read with a fafrly high level of proficiency. More 
courses for the illiterate efderly should be established so that they can function 
effectively in this highly literate society. ,'•'».•■ 
For the average elderly person, reading offers a means forgetting 
consumer information. Haworth (1955) found that the elderly expressed an 
interest in adult education courses that focused on consumer information. Yet 
few courses are available that focus directly on the needs and interests ot the 
' aqed' or that meet during the day when the^ would feel free to attend.' 

A small and very select group of the elderly are reading consumers in a Very 
special way They use the free tuition benefits now available in most states 
and take college, courses, this type of program, which uses/reading te,prbvide 
' knowledge for knowlege's sake, appeals to only a small proportion i of the 
• elderly and yet it is one of the few services provided uniformly by all states to 
the elderly Classes in basic reading instruction for the illiterate and consumer 



* education for the average 6lderly should' receive more emphasis. Haworth 
(1955) found 70%' of the ag^ed expressed educational needs. With such a lafge 
expressed need, certainty more can be done. , . " % 

* • ♦ v 

Reading as a Leisure Activity K 

ASide from serving an important function as an irflormation service, reading 
also aids the adaptation of the elderly by serving as a leisure time activity, both 
•a solitary activity andfe means for group activity. 

The_elderly, retired from. the working world apd Suffering some amount of 
' physical disability, have many adjustment problems. Retirees often find 
'themselves leading a secluded life. Inactivity breeds withdrawal, which 
gerpntologists call disengagement. Reading is seen as an alternative, a way to 
keep thejnind active and pass hours alone (Cassata, 1967).* 
. Reading groups can serve as a means for staying active. The time spqnt 
interacting with others who share an interest in reading can be enjoyable as . 
well as therapeutic. Lovelace (1977) conducted reading group for residents' 
of a nursing home and found the sessidns to be good therapy for participants. 
Not all the participants were literate, but. all viewed the groCip as a means of 
remaining active and as Something* to look forward to. Wilson (1977). 
conducted a*group in a retirement center, comprised, of healthy and avid 
readers. Her findings are similar. The group provided a scheduled weekly 
activity for the elderly. Reading was actually the springboard for a group social 
hour, since, no reading was done in the meetings, and little reading was 
discussed. Reading served as the common interest to gather people for social 
' interaction. * . . « 

So, readmg as a leisure activity serves as a useful tool to halp overcome 
adaptation problems. Whether reading alone to pass empty hours; .or 
participating in a group and interacting with ojhers, reading is a possible 
answer to adaptation problem^for the elderly. 

Reading as Part of the Problem 

Unfortunately the means for a solution serve as part of the problem, also. 
Written materials artffceing produced at an astounding rate in this country. Yet 
the rate of production and the products marketed raise three problems for'the 
elcterly. 

The Cost of Ma terials- 

First, the cost of publishing is skyrocketing, leading to a substantial 
increase in the price of written materials. For the elderly who have limited 
incomes, thfe prices may make owning books prohibitive. Avid readers find a 
means to get reading material, usually by swapping books with ,f riend9> or 
children. But the cost of magazines and newspaper subscriptions makes these 
materials scarce in the homes of even the most avid readers. New library 
services sup h as residential bookmobiles, retirement center libraries, and the 
free materials for the disabled reader from the Library of Congress are an 
attempt to make reading materials more readily available. Bui, are they 
enough? , • • • 

Available Materials 

Second, one must ^airline the reading materials that are available and ask 
if they areiof intsresi to the elderly. The National Survey of Library Services to. 
the Aging (t971) found'that most library services focus on children, -while less 
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than 2% of all funds are spent on the elderly. The American Library^ 
Association publishes only one booklist, Paths to a- Long Life, that lists books 
of interest to the elderly. An examination of this list shows no pleasure listings. 
All books on the list provide consumer information, which is certainly needed, 
but is not enough in and of itself. ^- ' 

Schramm (1969) found that the elderly spend 45% Qf their time in media 
activities Yet the median number of hours spent in reading was one tour a 
day or about 7% of theif total time. So little time in reading — why? Paftpf the 
problem may he in the fact that available materials do not appeal to the elderly, 
a paradox in a society that is producing written materials faster than ever 

♦before in history. a . , . .. ' . 

Kingston (1977) also suggests. that the nature of materials available might, 
be responsible for the small amount of time the elderly spend in reading. Few 
magazines appeal to the elderly, e.g., National Geographic and Readers 
Digest Elderly men cannot identify with the fictional male protagonist who is 
young and virile Elderly women prefer light romances with no se^ and dislike 
mystery and science fiction (Romani, 1970). One must ask if these types of 
materials are readily available. A quick look at the book racks in stores and the 

' fact that U S libraries spend so little money on the elderly lead dne to suspect 
the answer is "no ." 

Vocabulary Change , 

Third reading presents another, quite different problem exemplifying some 
of the nrnblems of future shock. This is a basic comprehension problem 
brought on by the fast change in vocabulary. Terms like fastback, was/7 and 
wear flashcube and intercom appear as. new words in reading material. 
Reading is impeded by new vocabulary, resulting in basic comprehension 

problems. . ^ 

While conducting the reading group in Athens, Georgia, this writer was. 
•contacted by an elderly woman. An atyd reader, she was to read a magazine 
article and report on it for her mission/society. She felt defeated because the 
article which was about her churcri's new evangelistic mission program 
beaming radio programs via Telstar to Iron Curtafh countries, made- no sense 
to her A quick explanation of the electronic concepts involved in the project 
easily remedied the comprehension problem, grounded in lack of background 
experience and vocabulary. This is just one example of what many elderly 
people-whe-are good readers face when they find themselves bogged down in 
Time Magazine, the newspaper, or even the church missionary magazine. 

/Where to Go trim Here 

Adaptation problems exist for the elder y and rise from two sources. First,, 
the shift from being employed and rais ng a family to retirement causes 
problems Second, this change in'lifestyle is aggravated by the fast pace and 
constant change, the future shock, in society. Both factors contribute to the • 
withdrawal and adaptation problems of the elderly. 

Reading offers solutions. It can be a means of disseminating information. 
However society has fallen short in providing many sorely-needed consumer 
service^ or basic education in living fir the elderly. Reading also offers' a 
leisure activity It is a viable means for dealing with both future shock and 
disengagement in the aged, servingas a solitary activity. and as a maans for 
group interaction. Here, again, while inroads are being* made througl\some 
few agencies much, work has not even begun. The services of the Libf&ry of 
Congress need to be advertised. More reading groups in nursing homes and 



retirement centers must be establPshed And certainly a society that produces 
sofhany written materials can publish books of interest to the elderly. 

Reading is offered as at least a partial solution for the adaptation problems 
6f the elderly. It is time to begin making. headway in using reading as a method 
of helping.thS elderly adjust to a new lifestyle in a rapidly changing world. . 
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Prdblems in the measurement of intellectual and linguistic 
» . ; capacities, including readings the aged 

■ Experimentation designed to investigate the hypothesis that mental capacity 
declines with aging has. proven troublesome -to investigators, many of whom 
encounter the following pernicious factors: • . " 

1) physiological and/or psychological health of subjects; 

2) cautiousness; ■ ; ' 

3) errors in fundamental experimental design. • 

5 Psysiologi|al and/or Psychological Health 

Is aqinq a clearly definable series' of biological and social events, or is it a 
process that can be discussed only metaphorically? Not only are there*) 
mecnanical, electrical, or chemical procelses that parallel aging, but, to compli- 
cate the problem, humans have four brains (neuronal, glial, biochem.cal, and 
vascular) each with its own genetic history, each vulnerable to spec ie 
presses -and each of which may age' at a different rate. As a rule 
biological characteristics show earlier maturation and decline than psychology 
capacities and functions. The pattern of age decrement is unique for #ach 
person, with rate of decline varying from one function to another, and fronr . one 
Sv^duaT to another (Verwoerdt, 1973). There, is difficulty distinguishing 
clearlv between biological aging, pathological aging, and disease. 

Neverth^ss there are some developmental truisms associated with agng. 
Increasing-age generally is accompanYed by a lossin sensory acuity, result ng 
n information being received incorrecUy or partial^ without the subject being, 
aware of it Perceptions of changes in body position W orientation in space 
become less accurate with age. A loss of strength and agility is noted-. Reaction 
^etegS slowing shortly after age 20; movement time fnd spontaneous 
speech however do not decline until after age 70. Unfamiliarity and/or 
exSneous sTmulation slow down the learning of new tasks Pressure tor speed 
results in the slowing of a task and in higher error rate (Carp 1973). \ 

However, before these developmental truisms are applied to all elder y 
persons, researchers should consider the major conclusion reached by Goldstein 
and Shell v (1975)' who examined the similarities and differences between 
psychoSca! deficit in aging and brain damage. They suggest that one canno 
• speak of mental decline with aging in a global way, and, that if trends 
' " toward declines with age do exist, the conclusion still cannot be drawn that 
a decline with aqe exists in all intellectual functions. . 

The future improvement of measurement of intellectual ^o^mong the 
aged will be dependent to a large extent upon investigators abil/es>p separate 
^textual effects from essentially age-related changes , e.. A™'^^^ 
nsvcholoqical changes. One possibility for more closely defining age of persons 
* fs the Halslead battery of neuropsychological tests-held to be rel.abale and 
va d in defining the state of higher-brain functioning independently of calendar 
* age sex trS education, ethnic origin, occupational history, and psychome toe 
. intelligence (I.Q.). The nonverbal tests attempt to determine the brains ability 
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to integrate in time and space information supplied to it by vision, hearing, 
and touch (Halsiead.l 963). ■ . , 

The foss of perceptual acuteriess may have led res'earchers to erroneous- 
generalizations as to the loss of cognitive abilities among the aged. MeyersQn 
(1976> suggests that, with the exception of the common finding of prebycusis 
(deterioration in hearing), the processes of communication do not seem to be 
seriously affecied by normal aging changes. This theory is supported by 
Graniek, Kleban, and Weiss (1976), who investigated the, relationships between 
mild hearing loss and cognitive functioning for two independently selected 
samples of aged persons whose hearing was within normal limits. The findings 
imply that aged persons may be more intellectually capable than their test 
performances suggest, and that hearing is an important variable to be considered 
in the assessment of their cognitive functioning. 

The possibility that reduced effectiveness in cognitive functioning is associated 
with> reductions in auditory acuity in persons who can hear and respond 
appropriately to normal speech raises significant questions about the reliability, 
validity, and meaning of much of the currently-available gerontological literature. 
It is unusual, for example, for the hearing statuflpf aged persons to be reported 
in descriptions of learning experiments, despite the fact that .the stimulus 
materials are often orally resented. The same holds for other types of 
investigations w^ich deal with intellectual factors, reaction ti,me, and personality 
functioning wherein,auditory perception and discrimination are involved. 

Maddox afid Douglas (1974) kuggest that dispersion of scores on a variety 
of physiological indicators increases over time for a^cohort of older individuals. 
One'pf the most severe aging effects is labeled terminal drop. Terminal drop 
(rapid change) in social, psychological, and physiological functioning immediately 
prior to death has been suggested by several writers as a possible source of 
individual differences in late life (see Riegel & Riegel, . 1 972, for further , 
citations). In studying elderly persons, it is* argued, investigators are in fact ■ 
studying samples which include both elite survivors and an undertermined 
proportion of individuals experiencing rapid decline in functional capacity prior 
to death* Carp (1973) supports this argument in regard to I.Q. scores, which 
are known to drop with serious illness. In some sense, the decline in I.Q. with 
"age is an artifact; if those people who are within a year of death are 
eliminated. from the average, intelligence test scores do not change significantly. 

Problems in measuring cognitive and linguistic functioning are often com- 
pounded by the gradual onset of impaired orientation, judgment, and intellectual 
functioning seen in the elderly due to sen/7e dementia. It has been estimated 
that as many as 10% of the elderly demonstrate symptoms of senility, the 
frequency being 20% above the age of 80 (Aire, 1973). Unfortunately, as 
Haase (1971) remarks, several diseases associated wi'th the elderly are 
clinically indistinguishable from sen/7e dementia. If researchers do indeed 
attempt to match persons on the basis of health status, great care is needed 
to avoid crediting the elderly with loss oMunction due to senility when senility is 
actually not present. 

Cautiousness 

Birren (1974) argues that with , age the central nervous system slowsiits 
capacity to take in, store, and retrieve information, which presumably influences 
a wide range of behaviors. But many researchers refute this' theory, and atigrr\ 
themselves with Botwinick (1969), who concluded that what appears to be a 
decline in retrieval speed may be instead cautiousness on the part of the 
elderly. Cautiousness in this case being defined as a hesitancy to become 




involved when risk is high. An experiment involving" two semantic memory 
tasks designed by Eysenck (1975) revealed a differential effect of age on recal 
and recognition, the older persons (12 forking schoolteachers, CA = 55-65) 
responding more slovyly on the recognition task, but not on the recall task. The • 
response times of the older persons w.ere less affected by, the dominance of 
the to-be-retrieved' information than were the younger persons (1 2 students at the 
University of. London, CA = 18-30). These results suggested that persons in 
the older age group may have relieved information faster than the young 
persons but mat they required longer to decide upon a response i.e.. 
they were more cautious. A replication of Eysneck's experiment with a larger 
sample is suggested as generalizations from his study are necessarily limited. 

In an experiment involving older (CA = 69.31) and younger (QA - 21.71) 
males participating in a vocabulary task involving various degrees of risk, . 
older males were found more cautious than young adults. The older males were 
also less likely to raise their aspirations following success. No effect of instruc-. 
Hons on cautiousness wag found for either age group (Okun & Divesta 1976) 

• In a similar study, Birkhill and Schaie (1 975) investigated the effects of differential 
reinforcement of cautiousness on intellectual performance in the elderly, 
employing 56 females and 32 males with a mean age of 73 years. Cautiousness- 
was manipulated by systematically varying pre-test instruction conditions 
involving the reinforcement of twoJevels of risk and of response omission 
when taking the Primary Mental Abilities Test. Results suggest that performance 
in intelligence tests is much' influenced by situational variables involving 
motivational components. Persons exposed to low-risk conditions performed 

•significantly better on the cognitive tasks .than .those exposed to high-risk 
conditions onlyA/vhen they were able to exercise the, option of responding or 
not responding on individual test items. ' . 

N Problems with Fundamental Experimejitalt)esign 

ExoerLmental behavioral studies of aging are either cross-sectional (stratified) . 
or longitudinal. Each design has its own strengths and weaknesses. In the 
longitudinal study the individual is his own control. The cross-sectional approach 
is particularly vulnerable to two factors: 1) the cryptic, or hidden, variable, and 
2) samplinq biases followed by overgeneralization. An example of the cryptic 
variable is 6ited in the study by Jones and Conrad (1933), who applied a 
modification of the Army Alpha examination from World War I to al adults 
between the ages of twenty >d sixty in a New Eng arid community . The 
decrements psychometric intelligence after thirty was attributed by the authors 
to aging but korge (1956) has cast doubt on the validity of this mterpreta ion 
- by showing (a probable inversecielation between the person's fornr^l schooling. 

^The^cross-sectional approach is particularly susceptible to the problems of 
purposive non-random sampling, wherein pecsons are selected by some purpo- 
sive method i e availability, and are therefore subject to the biases of personal 
selection All too often, stratified cross-sectional random sampling becomes 
stratified purposive sampling, e.g., 30 available college freshmen are compared 
with 30 available elderly in a . nursing home. Marascuilo (1971) rather forcefully 
emphasizes that 1) currently employed statistical methods are dependent upon, 
random sampling and the assumptions underlying this type of sampling, and 

• 2/ if the assumptions of this sampling procedure are not met, many 
\ Statistical analyses cannot be used. . ' . . , 

\ Abrahams- Hoyer, Elias, and/'Bradigan (1957) summarized psychological 
research on' human subjects published in the Journal of Gerontology from 
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1963 through 1974 along a variety of dimensions, including subject selection 
procedures, sample characteristics, data analysis techniques, research design, 
author characteristics, and area of psychological research. Cross-sectional design 
strategies were found in 67.2% of the studies published in the 11-year 
.period, which* augurs iirfor the generalizations made from these studies. Thirty- 
seven per cent of studies published during the 1 969-1 974 period did not mention 
the health of the subjects, while only 1 6.4% controlled for health. 

i . ' Conclusions 

Apparent loss among the aged of intellectual and linguistic capacities, including 
reading ability, may be due to loss of perceptual input, disease processes, 
acquired cautiousness, psychological and environmental problems, and/or faulty 
research design followed by overgeneralization. Suggestions for subsequent 
experimentation which are designed to counterbalance these deficiencies 
include: 

1) obtaining, if at all possible, a stratified random sample for cross- 
sectional ^studies; 

2) specifying clearly the age characteristics of samples under investigation; 

3) i undertaking sofrie form of. health assessment where it is known or 
suspected that disease states* influence age-associated performance; 

4) treating gender differences as sources of variance; 

5) wfoeje appropriate, making use of multivariate procedures as protection 
against chance differences resulting from application of univariate ANOVA's to 
multiple dependent factors; • . 

6) providing as estimate of Ihe proportion of variance accounted for in the 
dependent variable by the independent variable; 

. 7) generalizing only to the specific population from which subjects are drawn. 
Intellectual/cognitive functioning, including reading ability, has continuedas 
the dominant area of research in gerontological psychology. This .no doubt 
"reflects the large number of abilities and skills included by the term intellectual/ 
cognitive* functioning v the large number of measurement instruments ayailable, 
and the large number of variable^ "known to affect intellectual and cognitive 
performance. As the data base and the levdl of methodological sophistication 
have increased, the. parameters of adult intellectual performance, memory, and 
problem-solving have become more clearly delineated; but in view of increased 
knowledge, there has also been a need to reexamine previous assumptions 
based on limited knowledge and^^t^^^^ 

f References 

Abrahams, J P . Hoyer. W. J , Elias, M F. & Bradigan, B. Gerontological research in psychology. , 
^Journal of Gerontology, 1963-1974: Perspectives and progress Journal of Gerontology, 1975, 

30.668-673 ^ , 

Aire. T Dementia in the elderly: Diagnosis and assessment. British jMedicaf Journal, 1973, 4, 

540-543 

Birren. J F Translations in gerontology-from lab to life: Psychophysiology and speed ot response. 

American Psychologist, 1 974, 29, 808-81 5. 
Birkhill, W. R. & Schaie, K. W. The 'effect of differential reinforcement Of cautiousness in f 

intellectual performance among the elderly. Journal of Gerontology,* 1 975* 30, 578-583. 
Botwinick, J Disinclination to venture response versus cautiousness \r\ responding: Age differences. 

Journal of Geoetic Psychology. 1969, 115, 55-62. 
Carp. F. M. Ttfe, psychology of aging. In R. R. Boyd & C. G. Oakes (Eds.), Foundations of 

practical gerontology. Columbia, S. C: USC Press, 1973. 
Eysenck, M W: Retrieval from semantic memory as a function of age. Journal of Gerontology, 

X97S, 30, 174*180. 




Goldstein "g & Shelly, C H Similarities 'and differences between psychological deficit in 

aaina and brain damafie Journal of Gerontology. 1 975, 30, 448;455, 
G,2sS*H &-We lSS , A D Relationship WwwnhMnng loss and cogn,t«)n,n t 

normallv hearing aged persons Journal of Gerontology. 1976, 31 434-440. ...... PA . 

Haase G R Diseases presented as dementia In C, E. Wells (Ed). DemenVa. Philadelphia, PA. 

'H a iLad Da w S C 9 Merr*vfT 1 an P & KHen. B The Halstead index and differential aging In R H 
W*Ls C MS 8 Vv 'Donahue (Eds.), Processing aging: Social and psycholopjcal 

^s^vei^n^f fXmTn?\n6*L* differences: A longitudinal analysis of 
^social G psycholoSca? and physiobgicli indicators. Journal o, Gerontology 1974, 29 55-563. 
Marascu ioa I StlrLca! methods tohehav.oral scenes research New YOrte McGraw-H II 1971 . 
Meverson M D The effects of aging on communication Journal of Gerontology. 1976, 31, 29-38. 
Okun M A & DiVesta. F J. Cautiousness in adulthood as a function of age and instructions. 

H^T^Z^T^^ drop and death Developmental Psychology. '1972, 

Ver 6 Je°rd, 319 A .Biological charades of «he elderly. In R R, Boyd 1 C Oakes (Eds.), 
Foundations o/pracfica/geronfo/ogMRev ed ). Columbia, S. C. USC Press, 19M. 



er|c 



21i . • • 

201 



PATRICIA L ANDERS 
DOUGLAS CARDELL 
"» University of Arizona 



Self-perceptions and reading habits of 
. adolescents, college students, and adults 

A popular conception exists that* people coulcKread if they chose to do so, 
but that, for the most part, they do not chootfi to do so, Mikulecky reported 
^ three surveys in the 1977 NRC Yearbook whicotear witness to this idea: one 
- survey indicated that 1 % of the adult population completes a book per year; a 
second survey suggested that 25% of all adults read in a book per month; and 
a third 'survey reported that 58% of the population "never read, never 
finished" a book. m 
, This assertion and these Surveys provided the impetus fo/lnis study. Three, 
randomly selected groups were chosen: (1 ) adult reade'rs; (2) college Students;' 
and (3) junior high students. The subjects were surveyed to examine and 
compare their reading behaviors within three parameters: <1) their self- 
perceived reading abilities; (2) their self-reported reading habits; and (3) theft 
ability to answer an objective-type measure of reading retention. By examining 
and comparing these thre<e groups, the assertion tfiat the general population 
does not chOQSe to read could»be rejected or supported, at least for people 
across three a^e groups in Tucson, Arizona. - ■ 

. Methad , . , 

Subjects 

The adult population sample was chosen by randomly selecting 209 names 
from the local telephone directory. When telephoned, 199 of these 209 
potential subjects agreed to respond to a reading survey. Demographic. data 
reported by the 199 respondents matched, within 1%, published cenSus data 

• with regard to age, income, education, and ethnicity. 

The college students 'were surveyed wifh .questionnaires distributed in 
randomly selected liberal : arts classes. Questionnaires were completed by 176 
subjects. The average age of this group wa? 20 years, the average family 
income was $15,500 and the average education was 14 years. , 

The junior high students were also sampled with questionnaires handed out 
in school classes. Two classes were randomly selected from each of four 
public Tucson junior high schools. A total of 199 students completed the 
questionnaire. This sample also closely corresponded 'with published 
demographic data. 

Materials 

All subjects were asked the same questions whether on the telephone or on 
a dittoed handout. The questions were organized into four different types, The 
firsWype related to reading habits: how often newspapers, news magazines, 
trade magazines, and fiction and non-fiction books were read, and frequency 
of trips to the library. 

The second type of questions related to the subjects' self-perceptions of 
^ their reading abilities as compared with others about their own age. The 

• - 
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subjects responded to foirn statements by choosing one of three degree- 
related statements (i.e., somewhat more, about the same, somewhat less, 
etc.). 

♦ Thte third type of questions were designed to ascertain demograpnic 
characteristics of the subjects: age,, income, education, sex, and race were 

'included as demographic data. . . » 

The final category of questions was a true-false quiz to test retention of 
newspaper information. Questions were chosen for their suitability from the 
headlines of the two Tucson newspapers from the previous week. One or two 
words from the headline were then changed, added, or omitted in such a 
manner as to'^ive the new headline an opposite or greatly altered meaning 
For c example: "City council votes to raise water rates" became "City council 
votes to '°]|jp r wa * er rates..' ' 

Analysis t _ . 4 . 

The analysis consisted of (1) Pearson-Product-Moment Correlation 
coefficients to compare variables across all three groups, (2) calculation of 
each group's mean responses to each of 'the items, and (3) t tests for 
differences (a two-tailed test) between each of the means. 

Results 

The amdunt of reading reported by each c£jfcese three groups was a good 



deal greater than predicted (see Table 1 ): LibJJBry usage also appeared to be a 

Tablet 

Mean Number of Reading Materials Read Per Year 



Group 



general 
Population 

College 
Students 

Junior-High 
Students 



Newspapers 



News 
Magazines 



Trade 
Magazines 



Fiction 



Non-Fiction' 



312 


32J 


28.8 


21 


15 


395.2 


■r 39.6 


% 9.6 


15 


• 19 


254.8 


25.2 


6.0^ 


59 


. 35 



fairly common habit across all three groups: college students reported going 
to the library an average of ten times per month, junior high students reported 
an average of seven times per month, and the adult population reported an 
average of three times per month. Both the college students and the junior 
hioh students made significantly more trips to the library than did the adult 
population,? (370) = 1 2 64, p< .01 ,' and X (394) = 5.29, p< .01, respectively 

No significant differences were found between the adult sample and the 
college sample with regard to' newspaper readership. However, the a<MP 
population reported reading the newspaper significantly more than did the 
junior high students,? (396) = 4.19, p<. 01. 
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sThe quantity of newspaper, fiction, and norf-fiction reading correlated vyith 
feach of the group's attitudes toward reading (see Table 2). 

. ' . ;; * * ] • Table 2* 

Pearson Product Moment Correlations Between Attitudes 
and Type of Beading 



' Attitude 






Type of Reading 




* ' it 


n 

U. 




Pirtinn 


Mnn-Pirtinn 


Rate * 










Adults 


199 


-.2388*** 


\ -.2405*** 


-.1726** 


UGiiege 


1 7fi 
1 f 0 


01 71 * 


^ . - 1 rrr* 
• — . 1 ooo 




n h mfnr Minh 


1QQ 


- 2966*** 


-.3223*** 


-.2835*** 


Comprehension 










Adults 


199 


-.1230* 


-.2474*** 


- 1627** 


Coi'sge 






0^1 * * 


— . I Dot 


It minr I— 1 inh 

juiHUF niyii 


1QQ 

\ . 




- 0752 


-.1065 ' 


Epjoyment 






k 




Adults 


199 


-.2405***. 


-.3143*** 


-.1647** 


College 


176 


-.1269* 


-.1702** 


-.0325 


JuniorHigh 


199 • 


-.2713*** 


,-.2475*** 


-.2784*** 


#1 .. 'Amount 






/ 




'Adults 


199 


-.2288* ** / 


-.3152*** 


-.1543* 


College - 


176* 


-.1682** 


-.2262*** 


-.1537* 


Junior High . 


19f 


-.1382* 


-.3277*** 


-.3323*** 



*p<.05 
*p<01 
*j?<\001 



Furthermorerxotlege students thought they read more than other people 
their own age signifrcantfy more, f (369) = 2.53, p < .024, than did the adults. 
Junior high students reported reading faster than people their own age 
significantly more than didfthe adults, t (396) =*3.75.p<.01. 



RJO 



Finally, no significant differences were found between the population 
groups in their scores on tae'quiz designed to estimate reading retention. All 
groups-averaged about 58% correct responses. 

# -> * • . 

Conclusions t> , 

Tucsonans reported fading far more than \he previously reported studies 
would suggest. This result indicates that perhaps people are reading more 
than has been previously thought. 

Also, across all age groups, the subjects' self perceptions about reading 
and their reported reading behaviors were very similar. This tendS to support 
general folkloric notions that "either people do well at what they enjoy or they 
enjoy what they do well," even to the point of intellectual behaviors such as 
reading. . 

r. > ' . • 

References 

Arizona Statistical Review (32nd ed). Valley National Bank, Tucson, Atizoria, 1 976. 
• Mikulecky L J & Wolf A Effect of uninterrupted sustained silent reading and of reading 
games' on changes m'secondary student's reading attitudes. In P. D. Pearson (Ed.), Reading:- 
'Theory, research and practice (Twenty-sixth yearbook of the National Reading Conference). 
Clemson*. South Carolina. NRC. 1977. 



v- 




o 

ERJC 



205 



9 i r 



' -■ ■ . " 1 \ • *' ' 

. . ■■ • • . GEORGE W. McCONKIE - 

■ ' 6 ' Cornell University 

• ? ' *' ? * 

' ' * Stimulus con^ Inilpe study ot reading 

The act of reading is a very cd5fei^ : process, as many researchers have 
noted, and can be studied at acri/f^fa^ifferent levels! The purpose of this 
paper is to describe a research cjpprcfecivbeing taken at Cornell University to 
study reading at the micro-level, this ; project is aimed at gaining an 
^derstanding of the perceptual and fcognitive* events occurring during a 
fixation ana across successive fixations during reading by relatively skilled 
reactors. This paper will describe the technology which has been developed for 
the rlfcearch, pose a "number of questions about what is taking place during 
readjGj^a^ve examples of studies beaming Qn these questions, and discuss the 
potemiafvalue of the research approaqb being taKen. • 

' Technology *r - . 

* Over the past several years an eye^mov^merit contingent display control 
system has been developed for the study of reading. This is a computerbased 
system which monitors a reader's eye movements as he is reading from text 
displayed on a cathode-ray tube (CRT), and which is' capable of making 
changesNin the text contingent upon characteristics of his eye movements. 
* The details of this system have been described elsewhere (McConkie, Zola, 
Wolvertorv& Burns, in press), and,wiH"only be briefly mentidhed here. The 
■ system checks the location of the eye 1000 times pe/ second, and keeps a 
' cpmplete record of eye movements, duringneading, In addition, it detects 
wheh a saccadic', eye movement begins and §nds, and calculates where the 
eye is directed on'the CRT (on what letter position the.eye is centered). As the . 
computer obtains "this eye mbvement information durirtg reading, it is able to 
make changes in ttje text while reading is in progress, based on whether the 
eye is moving fcr still, or. where the eye is directed. ."for instance, a change in 
the text may be produced during a single eye movement so the text is different 
in some Specific way orf one fixation than it had peen 6n the last. Changes can 
^Isq be made duringyfe fixation, so that a certain number of msec, after a 
fixation begins one lihe of text is removed arid quickly replaced by .another. 
The nature of the stinijlus change can depend on where the eye is centered. 
At the present time vJb are developing a prediction .algorithm by which the 
computer can calculatevduring an eye*movement where the eye is going to 
stop based on the velocity pattern* of the eye, and make changes in the display 
so that when the eye cbmes to rest at that location certain stimulus patterns 
will lie at specific foveal or peripheral visual locations for that fixation. These 
display changes can be produced very rapidly: replacing one 70-character line v 
with another requires 2 msec. The nature and use of this computer system will 
become clearer as we proceed through some specific research examples. 
This yvill be done by bringing, up a series . of questions about reading, and 
dealing with each >n turn. 
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' . Some Examples of Research 



What information is carried frorh one fixation to the next? 

The first study to t?e described was carried out by Rayner (1975) several 
years ago using a system Which we developed at the Massachusetts Institute 

-of Technology (McCortkie & Rayner, 1974). In the study subjects read short 
paragraphs each contaihing a specific word location of interest, called the 
critical* word location. As the subjects read, an eye movement was detected 
which was likely to terminate in a fixation centered on that word location. The 
contents of that location (a word or a non-word letter string) were immediately 
replaced by a word of the same length. Thus, the word which occupied that 
location when the eye landed there was different than the word .that had , 
occupied it on the prior fixation. If the text prior to the change read "The , • 
\ robbers guarded the palace. . on the next fixation /the text read 'The v 

x robbers guarded the police. . .". The word "palace" had been replaced by the 
word "police" while the eye was moving to look at that word. Two questions of 
interest are: Would a reader detect^jch a change, and if so, when would it be . 
detected? Several theories of reading suggest that visual information is 
acquired and stored in some sort of buffer storage during a fixation, and only 
used at a later time when needed for reading (Shebilske ' 1 975). If .this werejhe 
case, it might be expected that the discrepancy would not be discovered on 
the fixation following the change (when the new word was being seen for the 

* first time) but only during a later fixation. It is of interest to find, then, that 
th^re was a substantial increase in the duration- of the fixation immediately 
following the change. Thus, the discrepancy in the text pattern from one 
fixation to the next was detected in some manner, and this occurred during the 
fixation in which the new stimulus was first. seen. It is not true that there is a 

' considerable* lag between the input of visual information and the time it can be 
involved in the reading process. It is also apparent that the information 
acquired during a fixation can influence the duration of that fixation. Finally, it . 
is of interest thai when the subject's fixation priorto the change was within 
three letter positions in front of the critical word location (thus close enough 
that the word could probably be read), there was no evidence on the next ^ 
fixation that the change had been detected even though the eye was centered 
on or immediately following the changed word. Thus, although the word was 
cleariv* withirvthe visual region on which it could be read if desired, it was being 

• ignored. Apparently not all words within the visual span region are "seen 
during a fixation. * - 

The Rayner study just described clearly shows lhat information is carried 
over from one fixation to the next; otherwise the change in the text between 
the two fixations would not be detected. McConkie & Zola (in preparation) 
investigated what information is being carried across fixations. Rayner (1975) 
suggested that there may be some integrative visual buffer, where the visual - 
information obtained on successive fixations is integrated into a single set ot 
visual information about the text. Display changes f rofn one fixation to the next 
would disrupt thfs integration process because visual patterns in certain 
regions would not match. If this were the esse, what would happen if peopfe 
• read text, printed in AITeRnAtlnG cAsE, and then during some saccade the 
case of every letter were changed; that is, every upper case letter were , 
changed \6 lower case, and vice versa. If the visual* pattern frqm successive 
fixations is indeed integrated in some buffer, such a otiangS should be highly 
disruptive to reading. With every letter changing shape from one fixation to the f| 
next any attempt at integration of the two visual patterns should be 



2 1 ~* 

207 **<^t 




J. 



devastating. However, ^McConkie & Zola (in* preparation) found that such 
changes had no effect on reading. In fact, when subjects knew that such 
chang« were taking place, anfl tried to se^ therp as they read, they were 
unable to. Whatever information is carried over frtfm one fixation to the next, it 
is apparently not simple visual patterns or even shapes of letter or words, It 
must be information at a deeper level of encoding- perhaps letters, letter 
pattern, sound patterns, etc. It \s also probably not only lexical meanings, 
since Rayner (1975) provided evidence that display" changes are detectefl at 
points where subjects aie not distinguishing between words and non-words. 
The nature of the information carried over from -one fixation to. the next by a 
skillednreader remains an interesting problem for further investigation. 

Frow what region ts visual information acquired during a fixation? t „ t 

Turning now to another question about reading, a series of studies has been ' 
conducted investigating the size of the perceptual span, or the region from 
which visual information is acquired during a fixation in. reading. In a recently- 
reported study (McConkie & Rayner, 1976), the text display was cliang&d on., 
each fixation. When the computer detected that a fixation had £egun, it 
replaced all letters which iay more than a certain number of character 
positions to the left and right of the fixation point with other letters. Thus, on: 
each fixation, a stimulus pattern was created such that normal text lay in a 
specific region around the fixation poinl, and oon-r'eadabfe letter strings lay 
outside this region. The following lines show how a line of text might have 
looked on each fixation as a person was reading part of the line: 

I, xxxxxxxxxxx. It appears that phctvnaxxxx xxxjcxxx xxx xx 

II. xxxxxxxxxxx. Xxxxxxxrs that pitch-naming abMxxx xxx xx f 
llh xxxxxxxxxxx. Xx xxxxxxx xxat pitch-naming ability cxx xx 

On each fixation, the line of text was changed sq that the normal letters 
were present for 4 letter positions to the left of t-he fixation point and a- 
considerable distance tathe right All letters outside this region were replaced 
by -other letters. The question asked in, the study wgs this: what would happen 
if the reader could not obtain useful visual information from words or letters 
more than 4 letter positions to the left of the fixation point? The answer is: 
nothing happens. Mutilating the text in this region seems tohave no effect on 
reading. However, if letters more than 4 letter^positions to the righf of the 
fixation point are replaced by other letters, reading deteriorates dramatically. 
1 Apparently the Vegibn from which visual information is acquired during a 
fixation by skilled 'readers of English is highly asymmetrical, lying primarily to 
the right of ,the center of vision. In line with a comment made earlier, this 
indicates that visual information well within the retinal region from which 
letters and words can be identified (Bouma, 1973) may be ignored during a. 
fixation in reading. Other information about the size of the perceptual span is 
reported by McConkie & Rayner ( 1 975) and by Rayner ( 1 975). 

, How is the eye guided? 

* Another question of some interest is whether the eye is sent to. a specific, 
preselected iQcation for the next fixation, or whether it is simply advanced a 
certain, distance rather automatically (Rayner & McConkie, 1976; Rayner, 
1978, Levy-Schoen & O'Regan, 1977). McConkie & Zola (in preparation) 
investigated this question by, changing the display during a saccade to cause 
the next fixation to lie at a location different from that to which it was originally 
directed. This was done by simply siting the text two letter positions to the 
right or the left during certain saccades. Thus, on the following fixations the 
eye was centered on a location two letter positions away from where it would 
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normally have landed in thetext. Two facts are apparent from this study. First, 
subjects are not consciously aware that any change has been made in the . 
display But second, their eye movement patterns are affected by the shift in 
the text There is a definite increase in the number of short eye movements, 
around two letter positions in length, following text shifts,- thus apparently 
indicating some sort of correction for the erroneous eye position. This 
suggests- that in some way the mind is directingthe eye rather specifically, and 
that the eye movement system detects inaccurate positioning of the eye in . 
reading. . 

SU Whi?e°the study of reading using v eye-movement contingent display control 
has only' begun, the power of this approach is already becoming evident 
through studies like those described above. In the few studies mentioned here, • 
the following facts about reading seem to be emerging: 

1 While visual information about the shapes of letters and words must 
be acquired during each fixation and used .to support reading, such specific 
visual detail is not carried across from one fixation to the next. This integration 
across fixations by Which peripheral visual information from one fixation can 
facilitate processing on the next (Rayner,' McConkie & Ehrlich, in press) must 
be occurring at some deeper level of encoding. _ * . 

2 During a fixation in reading visual information is apparently derived • 
from specific text locations (or retinal regions) rather than from all ot the text - 

" l?inq within the normal perceptual span. Some words lying within the visual 
> field'at locations where word identification is possible are ignored during a 
fixation' This is particularly likely for words lying, to the left of the center of 
- vision, from which meaning has probably already been derived during a prior 

fixation. . . . . . „ „„„•,,„ 

3 When a saccade is .initiated, the eye is being sent, to a specific 
location in the text. Landing at a position different from this can trigger 
corrective eye movements of some sort. 

The Potential of Research Involving Eye Movement 
Contingent Control Over the Display 

In another paper (McConkie, 1977) an attempt has been made to integrate 
these and other facts about reading into a way of conceptualizing the 
relationship between eye movements and language processing in reading. 
The point' to be made here is that the problem of how to exercise precise 
• control over the stimulus as a person is engaged in reading has be<?n largely 
solved (although refinements, are certainly needed) and that such control 
opens the door to a powerful new approach to the s.tudy of reading. Many 
specific questions about the perceptual and language processing occurring 
' durinq reading which were previously beyond our ability to study in a precise 
way become tractable within this researth context. There have been many 
puzzling questions about reading which we have been unable to investigate in 
a carefully controlled manner. For instance, just- what aspects of the text lying 
within the foveal or near-foveaf region are actually being detected and used 
during a fixation in reading, and how is this influenced by the linguistic context 
or -the reader's expectations at that point in the text? When a new fixation 
beqms how does the reader know where, in the large visual array he should 
seek information: from which line and which region on that me? How long 
. does it take from the time a given region of text (say, a particular word) is 
visually, encountered during a fixation .until the language^processes have 
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detected or produced certain meaning relationships which involve information 
from .this word? Once alternative answers to sOch questions as these have 
been stated, they can often be rephrased into sets of contrasting predictions 
about what wquld happen if this or that* display or text manipulation were 
4 madetf/hile a person were reading. Typically these predictions indicate which 
types of visual or textual manipulations would be detected or would cause 
difficulty jn reading and which would not, or when such difficulties would be 
encountered, if the perceptual and language processing occurred in a certain' 
way. Research already carried out has shown that the eye movement pattern^ 
can be sensitive indicators of the effects of such manipulations on reading 
(Rayner, 1975; McConkie & Zola, in preparation) Processing difficulties can 
be detected as changes in the duration of a particular fixation or length of a 
particular saccade, when averaged over subjects or over trials for the same 
subject. This is a type of precision which we have needed but largely lacked in 
v our prior reading research. 

The studies which have been described here all relate to the nature of 
relatively skilled reading. It should be obvious that as knowledge of skilled 
reading develops, these same techniques can be used to study the 
development of reading skill. But of even greater importance is the fact that, 
as we come to understand skilled reading better, a firmer.basis will be laid for 
exploring in detail the problems being encountered by less able readers. At 
some point, the technology which permits such flexible control over the 
stimulus' during rdading wiH become the basis for a new generation of 
computerized diagnostic instruments which are capable of precisely 
specifying a large number of very particular aspects of a given person's visual 
and language processing characteristics from data collected as he is reading. 

Certainly there are other levels at which the reading process must be 
studied^ such as those which will Indicate what people retain from their 
reading, and how this is inttne^ced by discourse characteristics, by the nature 
of the task they engaged in, anchby their own prior knowledge. But for studying 
the reading processes at the micro level which has been described in this 
paper, the eye movement contingent display control techniques provide a 
significant breakthrough in the development of reading research capability. 

1 The preparation of this paper has been supported in part by Grant No. MM24241 from-the 
National Institute of Mental Heal'.h. It was presented at the National Reading Conference, New 
Orleans, Louisiana, in December. 1977. The author wishes to acknowledge the assistance of 
David Zola, Gary S. Wo[vertor, and David D. Burns in developing the computer system described 
here, and in conducting much of the research. Requests for reprints should be addressed to 
George W McConkie, Department of Education, Cornell University, Ithaca, New York, USA. 
14853 . 
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Some reasons for focusing on classrooms in 
reading research 

During the past decade, reading, research has mushroomed in volume and 
sophistication. Much of this research has been cehlpred on what a person 
must know in order to acquire information from a printed page. This has led to 
a number of developments, some of them startling, in our sense of what 
readers must know to process letters (Estes, 1975), words^Smith and Kleiman, 
in press), sentences CWanat, 1976), and even texts (Fredericksen, in press; 
Griffin, 1977). A distant hope underlying all this research is that we might be 
able to locate the gaps in what deficient readers know about how to read, ahd 
thattwe might use all this research to develop a pedagogy designed to fill these 
gaps. 

As exciting as this progress has been, and as much as we must hope that it 
can continue, this paper represents a call for an alternative approach to the 
study of reading. Rather than focus on skills children must develop in 
achieving reading competence, this paper calls for an inquiry into one of the 
crucial environments in which children are asked to develop skiNs, namely, 
classrooms. The effects of other environments on learning to read, for 
example, homes, streets and work places, are equally important (Sticht, 1975; 
Szwed, 1977), but will not be addressed in this paper. As a spur to art inquiry, 
into reading in classrooms, the paper claims that given some uniform 
orgrnizationa! constraints on most teachers and children in American 
classrooms, most particularly, pressures to sort children into categories such 
as successful and unsuccessful, able and disabled, smart and dumb, different 
children get differentia^ access to reading instruction and differential time on- 
reading tasks. The product of these organizatiSnal constraints is that many 
children, although perfectly well equipped to learn how to read under other 
circumstances, fail to develop the .reading skills which mark a successful 
school career. If tNs conclusion is correct, then our problem with deficient 
readeggtfl not that they cannot develop various reading skills, but that they are 
not offered appropriate institutional circumstances for developing such skills. . 

This is a point of considerable consequence for our current strategies of 
research and pedagogy. At the present time, it is only possible to say how we 
cannot, in principle, solve reading problems with the findings of present 
research efforts. It would be nice to offer a more positive statement. Although 
there are strong signs that reading deficiencies ar^ socially organized, 
research on how this state of affairs is accomplished with each new 
generation of school children is just beginning. A recent review of ,the 
literature (Cazden, in press) correctly complains that we know little about the 
conditions under which o^jr children are asked to learn to read. We will have to 
do better. 

If our goals are to understand how some people learn to read while others 
do hot and then to develop an appropriate pedagogy for those who do not, 
most current research strategies which emphasize the analysis of information 
processing skills are inadequate. Such research may tell us about what most 
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readers might do while reading but little about how they develop such habits or 
what would have to be changed in order for. them to do so.. Such research 
reveals little about the lives of readers and the. organization of the world which 
keeps them engaged with print. ' , • 

The first' limit on current research, into reading is methodological. At the 
present time, there is no methodological warrant for. generalizing from 
laboratory research on wha^ readers seem to know how to do to the b fhavior 
of persons outside tbe laboratory. Happenings behind the eyes are hard to 
see Accordingly, analysts have had to set up well-specified and carefully 
controlled environments in order to allow for carefully drawn inferences - " 
the information processing skiHs of their subjects. In being so £ 
experimentalists have effectively restricted their generalizing powe 
conclusions about what it is their subjects know only under the condition? 
the experimenter (most likely) controls (Scribner, 1975). New cond _ 
demand new experiments, ad infinitum. As such, there are no principled ways 
of generalizing to the behavior of subjects in everyday life in which the 
•environments for their behavior are not so easily discovered by the analyst. 
This fact leads to continual surprises for the laboratory oriented investigator, 
as students of the social and cultural contexts for behavior keep producing 
accounts of the apparent use of skills in-everyday life on the parts of people 
who do not demonstrate such skills on tests or experimentstCole, Gay, giick, 
& Sharp, 1971; Cole, Sharp & Lave.M 976). . • 

Althouqh we have available many complaints that laboratory research is 
often ecologically invalid and difficult to use as a basis for policy decisions 
(Bronfenbrenner, 1977), it has only recently become clear that such research 
can be systematically and inherently irrelevant to the everyday lives of its 
subjects The analytic power allowed by experimental procedures by virtue of 
their careful control over only certain predefined dimensions of how people* 
organize their behavior renders experimental results nongeneralizable in the 
face of everyday life scenes which are organized in more complex ways (Cole, 
Hood & McDermOtt, 1978). \ • - 

The second limit of research on skills readers or potentia readers must 
have in their heads is theoretical. It is not clear What role, if any, the 
development of such skills has irixthe development of reading competence It 
is not even clear that such skills\are hard to develop, and it is not fair to 
assume that the problem with problem readers is that they ar.e missing skills in 
their heads. Although it is usually assumed that non-readers or poor readers 
are missing skills which we must teach them with a bag-of-tr.cks pedagogy, it 
has not been shown that the presence or absence of such skills is the key to 

any person's reading problem. - ... . _ _., 

If research is to produce relevant implications for instruction, it must speak 
to the experiences of the people who, have learned how tp read or who are 
learninq how to read. Research into the cognitive skills underlying efficient 
readme i is asslmed to capture something important about learning to read, but 
it does not speak at all to the circumstances under which people either learn to 
read or remain ijliterate despite years of instruction. Our. current 
preoccupation with individual skills offers us a theoretical naivete in the face 
of accounts. of how displays of literacy skills appear to be arranged easily 
under certain types of social organization: 

For example Soderbergh (1973) reports that she can consistently train two- 
year-olds to perform some basic reading tasks. Conklin (1949) reports a 
stunning case of an indigenous literacy system from the PhilippirTes, in which 
adolescents acquire literacy skills in the course of a few mOnths. Reports ot 



this kind are not unusual, the most recent being that of Scribner and Cole 
(t978a,b) who reporf on a delightful case from West Africa of a three-part 
literacy (English; Arabic amfvai, with three different orthographies) for many 
of the members of the Vat trrbe. 

These examples can be used to show that the acquisition of reading 
competence, like the acquisition of any skill, takes some hard work, and lots of 
practice. But nowhere does it seem to be as hard to acquire as it is for so many • 
of ourchildren in school. The point is that just about anybody can learn to do it 
if they are organized toputin,the time (Singer, 1977). Just about anybody can 
learn how to read if the organizational formats available to them allow them to 
spend a sufficient amount of time on task. We must give more attention to, how 
we might hefp to organize the circumstances for more reading to take place 
(Singer & Beardsjey, 1970; Smith, 1976). Then if we find that it is important to 
have a better gtasp of the'psychological skills a child needs to develop in order 
Jo learn to reatd, we can do so. At the same time, psychological theories of 
reading will have to be expanded to account for the differential consequences 
of particular learning environments. 

■ A third reason for'focusing reading research on how children do and do not 
learn to read in school is political. Such research would allow us to unpack 
what is apparently a key fact about the organization of differential levels of 
reading competence in America, namely, that more members of some groups 
rather than other groups achieve literacy, despite fequal numbers of years in 
school. This is a political football we could all»do without, and it wouid be good 
to understand how this fact is arranged, Then we can move beyond it to more 
relevant concerns for individual children. 

Traditionally, the approach to t the question of different^ school 
performance by members of different social groups has been tj^look to the 
characteristics of the different groups for an explanation, There?&re a number, 
of reasons for rejecting this approach. Whether the group^Tare described 
along genetic, cultural, dispositional; or communipativ$ grounds, there are 
many reasons for describing th6se explanations, even in their social science 
guise, as rooted in unexamined local prejudices (Cole, Gay, Glick & Sharp, 
1971; D'Andrade.. 19y3,., McDermott & Gospodinoff, in press):' But there is an 
interesting area of research here, If we look into the classroom contexts in 
which these fac4s (racef culture, and communicative codes) become salient in 
the organization of learning behavior then we may discover also how 
differential reading skills become -salient points of identity and sources of 
group borders in any given classroom. t 

Research into the organization of learning to read in one first grade 
classroom in a successful middle class school has revealed the following 
scenario (McDermott, 1976, 1977; McDermott & Aron, 1978). Children who 
entered school lacking reading skills were identified quickly by their teachers 
and handled differently. The organizational consequence of this differential 
handling was that they received only one-third the amount of time on reading 
tasks as the children classified as better readers. Thus, for every day that they 
were in the classroom, they fell further behind their peers in the development 
of reading skills. 

This scenario was worked out despite the fact that the teacher spent an 
equal amount of time with all the children, worried deeply aboU% those who 
were behind, and was one of the more successful teachers with such children. 
The following fact points to the different experiences of children who were 
behind. During their time at the reading table with their teacher, the children in 
the bottom group were interrupted some 40 times as compared to the children 
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in the advanced group who were interrupted- only twice. Now consider that 
two-thirds of the interruptions came from the children from the advanced 
group who entered the work space of the bottom group or from the teacher 
who had to-stop instruction in order to discipline children in other parts of the 
classroom At first, it appears to be a plot agaipst the possibility of the children 
who are behind ever catching up, and on the level of their learning to read, it 
just might as well be. But it is not a plot. It is simply everyone trying to do^thelr 
best given the way the classrooms in America are organized. The problem with 
our schools, is not that children and teachers do not try hard; tout that trying 
hard does not help to solve .the organizational dilemmas which flow from 
pitting children against each other on the basis of their reading skills. 

Children who come into school behind in the development of reading skills 
are treated differently, and often unfairly, because they represent pedagogical 
? and interactional problems for their teacher. Most often,, their lack of skills has 
nothing to do with their ability to learn to read; often it means that their parents 
have not taken the time to teach them to read at home, or that they are natives 
to another language or culture. Or you can claim that they are less 
developmental^ advanced than the better readers, or, if you must, but I will 
not jdin you that they are not as innately smart as their peers. The reason for 
their being behind at age six need not make a permanent difference. They can 
still learn how to read at seven, eight or ten (Downing, 1973; Rohwer, 1971). 
But once they get only one-third the amount of time on task as their peers, they 
are in 'trouble,- so much trouble that with time, they give up on being 
unfavorably compared to their peers on reading and.begin to develop other 
identities such as class clown/bully, flirt, rebel (Labov & Robins, 1969; 
Piestrup, 1973; Shuy, 1977). ^ 

Children who come in behind are a -pedagogical problem, because they 
need more of the teacher's time to catch up. They are an interactional 
problem because being behind is an embarrassment which disrupts their 
relations' with the*other children and the teacher. The question in American 
classrooms is not whether or not everyone can learn to read, but how fast a 
child cart learn to read and how much better than others in the same class. 
Identities and institutional biographies are on the line. Everyone is sorting 
everyone else out into the successful and unsuccessful piles. To make 
matters worse everyone is doing this while making believe that this is not 
what is happening. The handling of these identities and coverups takes time 
and generates the organizational mishaps which keep the problem children 
from spending time on task, from spending more time on becoming less of a 
problem Despite our ideology of equal opportunity for all, our classrooms are 
organized around sorting children into those who can make it and those who 
•cannot Children who bring any differences to the classroom are easy marks 
f<* being sorted out. However, if the natural variation in culture, language, 
interest and skifl was not available, no doubt we would have, to make some 
even more arbitrary ways of distinguishing the good from the bad, the 
successful from the problem, the smart from the dumb. . , 

If this scenario has any relation to how schools try to develop reading skills 
in children, then much of the research presently getting done on reading 
represents a limited effort which keeps us from understanding the real 
' problem of the reading problems in our schools. The real problem is that our 
schools appeared be organized, no matter how unconsciously, no matter how 
- ' much by hard working, caring and devoted people, to produce a certain 
percentage of school failures among any new generation of school children 
This is a social organizational problem and demands a social organizational 
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solution. Neither research nor training in 'the skills underlying readihg 
competence will help us alter the social situation. Work on motivating better 
readers will have to start with this problem. 
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The influence of a structured overview "... . 
on comprehension and oral reading miscues 
o f selected college students 



The influence of priori knowledge on the meaningful reception, integration, 
and retention of new concepts is a widely accepted precept in learning theory 
(Ausub^l, 1968; Smith, 1975). A group of reading professors asked to read a 
passage on ocean engineering will display precise, yet meaningless word 
calling (Goodman, 1977). This is hardly surprising, t\nce the requisite 
nonvisual information available to the practicing ocean engineer is simply 
nonexistent for the reading professtir. 

An absence of prior knowledge about a particulafcsubject, compounded by a 
lack of teacher guidance in developing cognitive readiness may impede a 
learner's understanding of new concepts, /his is not a unique or 
earthshatlering notion, yet many university and community college reading 
improvement programs continue to "plug"' students into pre-packaged 
commercial kits and collections of articles with scant attention to cognitive 
readiness (Karlin, 1 977). 

A number of studies have been conducted in. content area classes to 
examine the efficacy of pre-teaching concepts and key vocabulary with a 
graphic advance organizer or structured overview (Vacca, 1977). A structured 
overview presents the learner with a hierarchical map for pre-reading 
discussion aimed at integrating his prior knowledge with new information. 
Earle (1 973) reviewed research in cognitive readiness and recommended that 
structured overviews receive additional research attention in the content atea 
classroom. ^ 

The purpose of the two exploratory studies reported here was to examine 
the effect of a structured overview on the comprehension and oral reading 
miscues of students enrolled in a college reading improvement course. At this 
stage of investigation, the focus was on identifying trends in the data that 
might suggest areas for more detailed study with a population of students 
outside the content area classroom. 

t Silent Reading Study 

The . first study examined the dependent variable of silent reading 
comprehension under three conditions. Specifically, would there be any 
significant deference in comprehension for students receiving pre-reading 
instruction with a relevant structured'oyerview and students receiving either a 
placebo overview on an unrelated topic, or no pre-reading instruction? 

Method 1 - 

Eighty-five university students enrolled in a reading improvement course 
J participated in either th^structuredwerview treatment or control conditions. 
Students participated in" the study according to their respective instructional 
periods. Thus, students in period five recieved the structured overview in a 



teacher guided Dre-reading discussion (n = 28), period six, the placebo 
(n = 27), and period set/en, no pre-instruction (n = 30). Subjects in .this 

• " exploratory study-were not randomly assigned to treatment or control 

conditions, and individual reading achievement levels were not taken into* 
consideration. 

Each of the three conditions involved silent reading of a Health Education 
article (McPherson,' 1972) from a collection of reading improvement 
sele^ons. The article was at the college level of difficulty according to the Fry 
Graph(1968). Following the reading, students took a '15 item, researcher 
constructed comprehension test highlighting cause-effect relationships from 
the selection. No validity or reliability data was computed for this instrument. 

Results 

Three f-test comparisons were made to evaluate differences among tne 
three groups The difference between the structured overview and control 
qroup was statistically significant for silent reading comprehension, t (56) - 
3 72 p < 05 A significant difference was also revealed between the 
' structured overview and placebo group, f (53) = '2.40, p < ,05 And 
comparison of the placebo and contrel group evidenced statistical 
significance, t (55) = 2.35, p < .05. Tha/means for the structured overview, 
placebo, and control groups were 14.(5, 10,63, and 10.16 on the 15-item 
comprehension test. 

Oral Reading Stuc 

A related study was conducted to examine the effect of the structured 

* ' overview on the post oral reading comprehension and quality of miscues 

generated by a sample of university student*; enrolled m the reading 
•• improvement course. i , . 

An assumption implicit in the construction and use of the structured 
overview is that pre-reading exposure to technical vocabulary, coupled with 
teacher pronunciation of unfamiliar terms, will reduce the amount of 
nonproductive encoding a student resorts to while reading (Vacca, 1977). It 
this assumption holds, students should reveal fewer miscues on vocabulary 
that was pre-taught with the structured overview. Thus, the following three 
questions were examined: 

(1) Would there be any statistically significant difference in the post oral 
reading comprehension test scores of the structured overview and 
placebo group? " . . 

(2) Would there be any significant difference in the quantity of miscues 
generated on vocabulary pre-taught with the structured overview in 
comparison to the placebo group? 

(3) would there be any significant difference in the quantity of 
semantical^ unacceptable miscues generated by the structured 

, overview and placebo group? 

^Terfuniversity students enrolled in a reading improvement course were 
randomly assigned to either a structured overview condition (n = 5) or Placebo 
overview condition (n = 5). In both conditions students read the Health 
Education passage orally after receiving pre-instruction with a relevant 
structured overview or placebo overview. The oral reading was taped and 
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subscribed to the scenario for administration and analysis in the Reading 
• Miscue Inventory (Goodman & Burke, 1972). Following the oral readings 
students took a four item, researcher constructed short essay test highlighting 
cause-effect relationsj^rps from the selection. No validity or reliability data was 
computed for this instrument. 

Results 

Three f-test comparisons were made for each of the questions under 
consideration. The difference between the structured overview group and 
placebo group on comprettension was not statistically significant, t (8)= 1.90, 
p > .05, but did rea^h sligKt significance atp < .10. A difference in quantity of 
% miscues generated^ vocabulary pre-taught was not detected, t (8) = -1 .44, p 
> .05; p > .10. The^uaQtity of semantically unacceptable miscues did not 
reach.statistical signif icancrfurtfteivvo groups, t (8) = .867 t p > .05; p > . 1 0. 

/ 

Discussion J 

The results of the first study seem to offer tentative Support for the 
structured overview as a means of guiding and increasing silent reading 
comprehension in the reading improvement classroom. Even a placebo 
overview seems to develop in students a mind set to approach a reading 
selection with a predisposition for meaningful retention of major concepts and 
subordinate ideas!* 

This exploratory study needs to be replicated with some modifications. That 
is, subjects should be randomly assigned to treatment and control conditions. - 
Independent variables sqch as reading achievement, validity and reliability of c 
the comprehension test, and prose characteristics of the reading selection 
might be considered. Other dependent variables such as delayed recall might 
provide information on the degree to which a structured overview functions as 
a mnemonic device. u j 

The second exploratory study required subjects to read the selection orally, 
- an unnatural task at secondary level. This feature, in conjunction with the 
small sample size, provided little information on the degree to which pre- 
teaching vocabulary with a structured overview facilitates fluent reading. 
Perhaps a more refined version of this study would produce more useful data. 
The procedures used in this study would require a fairly large staff (e.g., 10) of 
trained miscue examiners to collect the '6ral readir^gsamples. 

In summary, the first study repQrted here lends support to Earle's (1973) 
contention that the structured overview is both a valuable means of providing 
cognitive readiness, and a device warranting continued exploration in a variety 
of classFOom settings. 
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The effect of expanded directions 

and adjunct aids on 
students ' comprehension of worjd 
history text 

There has been substantial research reported on the cognitive processing 
of readers in prose learning situations. Experimental emphasis in prose 
learning research by and large, has been on what the reader cJoes^during the 
learning episode. Wiitrock (1963) indicates, "The result of guidance in 
learning must be evaluated in terms of what the guidance causes the student 
to do" (p. 74). Emphasis on what the reader does, according to Faw and Waller 
1 (1976), has led to a "simple" methodology hi the study of prose 'learning: 
"manipulate the students' activities during acquisition by means of special 
instructions, suggestions, or questions, and observe what effects these 
variations have on learning and, retention" (p.692).jjhe problem, then, has 
been one 6f determining how experimental manipulation modifies cognitive 
processing in a way that facilitates what will be learned. 

Manipulation, in general, has either been preparatory 'in nature, i.e., 
advance organizers (Ausubel, 1963), structured overviews (B.arron, 1971), and 
• directions (Wittrock, 1963), in order to modify the reader's set tolearn. o'r else 
i, has been quiding in nature, i.e., inserted questions (Frase, 1970, 1977), and 
reading gujdes (Herber & Vacca, 1977), in order to modify the ongoing - 
processing- behaviors of the reader. Frase (1971), in fact has proposed a ' 
heuristic model for, research in prose learning in which he delineates the role 
of adjunct ajds to account for the/utility of any class of verbal stimuli (advance 
organizers, directions, questions, statements, etc.) in facilitating learning .from 
text. Adjunct aids may be thought of as "orienting directions" (Frase, 1970) 
which dispose the reader to respond actively to certain aspects ol text. 

Questions doubtlessly form a subclass of'adjunct aidswhichnas received 
considerable attention from reading researchers (Smith, 1977). Three 
characteristics of questions appear to have facilitative effects on text learning: 
1) the position of 'questions in text, 2) the contiguity of questions to related 
content, and'3) the type 9f question asked. It is assumed by this investigator 
that tf]ese characteristics hold across various classes of adjunct aids ottVer 
than questions. 

Ri.ckards (1975-76), for example, has examined the process effects of 
advance organizers interspersed in text. He varied the type of ^organizing 
statements (superbrdinate and coordinate) and their position either before 
(prestatement) or after (poststatement) associated segments of text. Through 
such experimental manipulation, RickardS determined that superordinate 
prestatements (advance organizers) not only led to significantly greater 
contiguous recall of subordinate facts but also generated recall of information 
derivable from paragraphs incidental to the organizers themselves. 

In the present study the investigator examined the facilitative effect created 
by expanded directions and adjunct aids on.students' comprehenion of. world 
history text in tenth grade social studies. It was predicted that a preparatory 
set to (earn, induced through expanded directions (Barron, 1971), and adjunct 
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aids positioned en masse either before or after the text selection, or 
interspersed individually within the text selection, would have a positive effect 
on students' comprehension of world history content. 

Method 

Subjects 

The students were drawn from the tenth grade of a parochial high school 
located in a Northeastern city. Although an entrance test is required for 
admittance, subjects represented a wide range of ability levels. The students, 
on the whole, cam£ from lower middle to upper middle socio-economic level 
homes. Nearly all participants were Cauc^sian. 

Ninety-seven pupils comprised the original sample. Only 74 students, 
however, were included in the final data analysis. In order to have been, 
included in the final sample, students must have participated in every aspect 
of the experimental treatment. 

Two teachers participated in the experiment. One teacher was responsible 
for thrfee treatment groups and the other for two treatment groups. 

Procedure ' - 

Five intact world history cjasses were assigned randomly to one of five 
treatments. Below is a description of the treatment groups: 
Experimental Group 1 (E^: Received expanded directions plus pre-exposure to 

* ? adjunct aids for each text selection 

Experimental Group 2 (E 2 ): Received expanded directions plus post -exposure to 

adjunct aids for each text selection 
Experimental Group 3 (E 3 ): Received expanded directions plus adjunct aids 

interspersed within each text selection 
Experimental Group 4 (E 4 ): Received expanded directions plus required to read eac'h 

text selection without any exposure to adjunct aids 
Control Group 1 (C,): Required to read each text selection Without exposure to 

expanded directions or adjunct aids 
Each of the experimental groups was exposed to a two day period of 
expanded directions as specif iedby Barron (1971). Expanded directions were 
operationalized to include: 1) the reading of the prepared handout on the 
importance of perceiving thought-relationships in text and drawing 
interpretations from them. 2) a review of the basic patterns of thought- 
relationships (cause-effect, comparison-contrast, time order, enumeration) 
and the "reading signals" which help identify these patterns, 3) Completion of 
exercises to provide students wi*h practice in identifying the patterns as they 
appear in sentences and paragraphs taken from the textbook, awd 4) guided 
explanation of the use of adjunct aids in each of their respective treatment 
groups. 

The adjunct aids for each text selection were comprised offset of 
statements which represented interpretations Of the text selection. A set of 
statements w v as positioned en masse either before or after each selection of 
text in groups' E 1 and E? respectively. In group E 3 each adjunct statement was 
individually embedded immediately after that segmerit-of the selection which, 
revealed the intratext relationship that led to an interpretation. If the students 
in groups E 1( E 2 or E 3 believed a statement to be a reasonable or "correct" 
interpretation of the text, they checked the appropriate space to the left of 
each statement. This procedure is recommended by Herber (1970) as a way of 
• maintaining the reader's active response to each statement. 

Group P 4 received expanded directions as operationalized above minus 
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step 4 (the explanation of how to respond to adjunct aids).,TheV$ontrol group 
received two study hall periods in place of expanded directions. All groups 
were given 45 minutes to complete each text selection, with or without adjunct 
aids. 

Three sections frdm two chapters in a world history textbook were selected 
for reading. The selections, "The Paris Peace Conference," "The Spread of 
Democracy," and The Growth of ^Nationalism," were chosen, essentially, for 
their length (1000-1500 words) and their relevance to the unit tipic. "Post 
World War One Era." The investigator developed the set of adjunct aids which 
accompanied each reading. 

The experiment took six school days to complete, Monday and Tuesday of 
the first week were devoted to expanded directions. Students were then given 
successive text selections on Wednesday, Thursday and Friday. On Monday 
of the second week, all groups were administered a 30-item multiple choice 
test in which 45 percent of the questions tested knowledge, 45 percent tested 
comprehension and 10 percent tested application as outlined in The 
Taxonomy of Educational Objectives (Bloom, 1 956). 

Design and Statistical Analysis 

ThB basic design of the study was the "Non-Equivatent Control Group 
Design" (Campbell and Stanley, 1963), since the experimenter had only intact 
classes at his disposal. 

Analysis of covariance was used to analyze the data. The subtest, "Social 
Studies Reading," from S.R.A. National Education Development Tests was the 
covariate used to yield scores that controlled for the criterion variable, scores 
on the world* history content test. !f the overall F (wit>h alpha level set at .05) 
indicated significant differences among treatments for comprehension of 
social studies content, all patrwise comparisons of means were to be 
evaluated by a post hoc technique developed by Jukey, 

Results 

The hypothesis, cast in its null form for statistical analysis, stated that there 
would be no differences in ability to comprehend world history text among 
students in any of the five treatment groups. The analysis of covariance, 
however, indicated, a significant overall difference among the groups, F 
(4,68) = 5.80, p <.05, thus rejecting the null hypothesis. The means for the^five 
groups on the 30-item comprehension test were 21.14, 18.72, 17.75, 16.60 and 
16.5§. 

Post hoc inspection of the data determined that students in E3. (expanded' 
directions combined with interspersed adjunct aids) comprehended the text 
selections significantly better than E 4 (expanded directions and reading-only) 
andC 1 (reading only), 

Discussion 

The observed outcome of the study, that students, receiving expanded 
directions combined with interspersed, adjunct aids (Eo) 1 performed 
significantly better than either students receiving' expanded directions arWEJ 
reatfgg-only (E 4 ) or reading only (C^ may be attributed to several factors. Wnh 
respect to positioning adjunct aids in text, the study reaffirms a consistent 
observation in prose learning research. That is, adjunct aids "work" best 
when they occur immediately after relevant portions of text (Fra$e,-1970; Faw 
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& Waller, 1976). Whereas pre-questions function to cue responses to specific 
stimuli, interspersed questions "placed after the related text . reinforce 
learning behaviors which are not limited to specific stimuli" (Frase, 1970, p. 
345). Since the outcome measure for the study, contained questions at the 
knowledge, comprehension and application levels of Bloom's taxonomy, the 
E 3 treatment obstensibly exerted influence ofi cognitive processing beyond 
intentional or statement-specific information and, indeed, had a facilitate 
effect on total learning. . 

It should be noted, further, that the statements in experimental conditions 
Ei E ? and E 3 were superordinate in nature. As described by Rickards 
(1975-76), a superordinate statement is defined as "a single sentence 
representing a topic or overriding concept implied" in a text segment, i.e., an 
interpretation. Thus the adjunct aids positioned immediately after relevant text 
segments may have had a post organizer effect which facilitated the search 
activities of the students in Eo. 

This study also suggests fnatprovisions for a process set alone may be too 
generalized to guide learning. Simply knowing that a particular behavior 
should be used in a learning situation, and establishing its importance to 
content acqusition, is not sufficient. Probably students who lack maturity as 
learners require structured^guidance in ongoing processing activities during 
the reading act. 
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EJfects of uninterrupted sustained silent reading 
and of reading skills instruction 9 
on changes in secondary students ' 
reading attitudes and achievement 

Lyman Hunt (J970) introduced the concept of Uninterrupted Sustained 
Silent Reading, or USSR. Proponents of USSR, such as Allington (1975), Gaoz 
and Theobold (1974), 'McCracken (1971), and Mork (1972>, have strongly 
recommended adoption of the program^ Which allows students to read self- 
selected materials in school on a regularly schieduled basis. USSR programs 
may also be referred tojbs SSR (Sustained Silent Reading), Hip (High Intensity 
Practice), Reading Break, or Free Reading. 

Research on the effectiveness of USSR is limited and inconsistent. On the 
elementary level. USSRi in conjunction with some skills training, has been 
• found to be as effective in skills improvement as an equivalent amount of time 
spent in skills-only instruction (Evans & Towner, 1975; Hanjon, 1972; Oliver 
1970). Hanson also /found that USSR had a positive effect orr reading habits. 
Wilnrpt (1975) found significant improvement in reading attitudes among USSR 
groups of elementary students, but control groups with no USSR improved 
significantly more in reading comprehension; no significant differences were 
found m vocabulary growth. Petre (1971) provided several anecdotal reports of 
successes in 50-75 Reading Break programs in Maryland elemenfary schools. 

Little research has been done on the effects of USSR on older students. 
Mikulecky and Wolf (1977) reported no significant differences among seventh 
grade subjects experiencing USSR and twp other treatments in required 
reading classes. Attitude toward reading declined in all groups, although the 
USSR group had the least decline. The overall decline was interpreted asjDart 
of the general phenomenon of decline in attitude toward reading as more time 
- is spent in school (Bullen, 1 972; Mikulecky, 1 976). « 

Although USSR can be implemented in any classroom setting, at secondary 
levels it seems most often attempted as partof required reading courses. This 
study examined the effects of two typicaMreatments, Skills Instruction Alone 
and Skills 'instruction Plus USSR, on the reading achievement and reading 
attitudes of seventh grade students in required nine-week developmental 
reading classes. A single research question was investigated: Did students 
who experienced 100% skills instruction differ in reading achievement and 
attitude toward reading from students who experienced 80% skills instruction 
and 20% USSR? ^ ' # 

r ^ ' Method 

Subjects 

The 125 subjects were white seventh grade students attending junior high 
school in a suburban community near Milwaukee.* 
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There were two treatments: Skills Instruction Alone^nd Skills Instruction 
Plus USSR. Subjects experiencing Skills Instruction Alone worked tor live 40- 
minute periods a week for nine weeks on reading skills units such as context 
clues, word structure/and main idea. Subjects experiencing Skills Instruction 
Plus USSR worked tor tour 40-minute periods a week on the same skills units 
taught in the skills-alone treatment. However, tor one 40-minute period each 
week subjects read selt-selected materials brought from home or chosen trom 
a classroom library. 

Treatments were conducted simultaneously for the duration of a nineweek 
required developmental reading course. Two classes were randomly assigned 
to Skills Instruction Alone and three classes to Skills Instruction Plus USSR. 
Subject^ were pretested \fcfth two measures of reading achievement, the 
vocabulary and comprehension subtests of the Stanford Diagnostic Reading 
Test (Level II, Form W), and with two measures of attitude toward reading, the 
Estes scale (Estes. 1971) and the Mikulecky Behavioral Reading Attitude 
Measure (MBRAM) (Mikutaky, 1976). Students were posttested.with parallel 
forms of the achievement cneasures and with the srogle forms of the attitude 
measures. , / 

Data Analysis A 

One-way univariate .analyses of covariance were calculated for the 
6omprehension, ^babulary, and attitude posttest scores with the 
corresponding pretest score as covariate. 

y / * Results 

Analysis of-eov&riance for vocabulary showed, no significant differences 
between the two treatment groups, F (1 ,102) = .07, p > .05. Mean raw scores 
for both groups declined fromJhe pretest to the posttest. However, the decline, 
was less than one point for either group whicb was well within the subtest's 
standard error of measurement of 2.2 points. Analysis of covariance for 
comprehension showed a significant difference between treatment groups, F 
(1,100) = 4.61, p<A>5. Mea/ raw scores declined for both the Skills 
Instruction Alone groui (-3.65 points from 48.72 to 45.07) and the Skills 
Instruction Plus USSR group (-1.03 points from- 47.97 to 46.94). While the 
decline was significantly smaller in the group experiencing both skills 
instruction and USSR, both declines were close to or withirr the subtest's 
reported standard error of measurement of 3.0 points. 

Analysis of covariance for both the MBRAM and the Estes scale indicated 
no significant differences in attitude toward reading between the two 
treatment groups, F (1,104) = 2.55, p > .05 and F (1,101) =. 1.67, p > .05. 
Within each treatment group, mean raw scores declined from the protest to 
the posttest. For the MBRAM these nonsignificant declines were -.89 (60.1 3 to 
59 24) for Skills Instruction Alon£ and -5.07 (60.77 to 55.70) for Skills 
Instruction Plus USSR, and for the^Estes scale -2.92 (71.62 to 68.70) and 
-4.37 (74.25 to 69.88) respectively. There was a greater, but not significantly 
greater, decline in attitude in the group experiencing USSR. 

Lack.of significant difference between treatment groups on the vocabulary 
measure and a significantly smaller decline on the comprehension measure 
for the Skills Instruction Plus USSR group indicate that 20% more time in skills 
instruction did not benefit students' reading achievement. Also, one day a 
week spent in USSR did not result in either a gain or a smaller decline in 



attitude for the Skills Instruction Plus USSR group. 

In both treatment groups, raw scores declined on all measures from pretest 
to posttest. Declines on the achievement measures probably resulted from 
standard error of measurement or from a ceiling effe , since better readers 
had scored very well on the pretest with little room for growth on the posttest. 
The declines are probably not educationally significant. Declines on the 
reading attitude measures ranged from less than two points to a high of over 
ten points. Only one class.in either treatment group experienced a decline of 
more than two points on both measures. This class had. expressed great 
dislike for the course. Attitude declines probably represent the typical 
decrease in positive attitude experienced as students go through school or 
• resentment at being required to take the reading course. 

This\study suggests that shortt$rm reading courses may have little impact 
on reading achievement scores and may be detrimental to attitude toward 
reading even when USSR is incorporated into the coursework. USSR need not, 
however, be linked to a reading course, and research should be conducted on 
its effects at different grade levels, overextended periods of time, and in 
different instructional settings. 
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An investigation of . / 

the RE A P reading/study procedure: 
its rationale and efficacy 

i 

"■ • 

The REAP procedure is a teaching-learning strategy designed to improve 
reading writing, and study skills. The focus of this study is upon advancing the 
design of the strategy and assessing various aspects of its use as a 
. reading/study procedure within the setting of a college reading improvement 
cours© 

. The REAP Reading/Stfldy Procedure has four basic steps. The student is 
asked to: 

r _ Read to discover the writer's message. 

E — Encode the message by putting it into his/her own language 

A — Annodate by cogently writing the message in notes for him/herself or in a 

thought book to share with others. 
P —Ponder in other words, process the message, now in annotation form, 

through internal "dialectic thinking" or through discussion with others. 

Central to the REAP procedure is having students learn to write annotations 
specially designed to achieve certain iearning objectives (Manzo, 1973 \. In 
writing such annotations, the reader . is required to discriminate and synthesize 
the ideas presented by the writer, translate these into his/her own language, 
and crystallize the result in writing. ' 

Seven annotation forms have been developed .in working with the 
procedure: summary, thesis, question, critical, heuristic intention, and 
motivation annotations. Some annotations are more suitable to some types of 
writing than are others (Eanet & Manzo, 1976). The present study with its 
emphasis on study reading utilized the first four most frequently. 

The theoretical rationale for the REAP procedure finds a supportive base in, 
several relaterfareas of research in reading and learning. , MrM , 
Gagne (1973), among others, has hypothesized that presenting | a i learner 
with a specific learning task (for example, the charge to annotate) actuates a 
"set" within him that determines the "strategies" with wh ch he app caches 
the material he is to read. He concludes that these s rateg.es of search, 
selectinq and coding"'in large measure determine what the learner w I retain. 

Suppor'tinc | evidence for the* existence of these strategies and their flexibility 
is plentiful.. Anderson and Biddle (1975), in their comprehensive review o 
adjunct questions, find the explanation that such questions effec . 
. performance "by influencing the processing activities ol readers > most 
persuasive. A number of prose learning studies give evidence that other task 
or "set" variables can also determine what a reader remembers, (e.g., hrase, 

l9 The use of an overt, constructed response as a productive learning task 
finds some support in studies of adjunct questions (Anderson & Biddle, 1975) 
and programmed learning (Williams, 1963, 1965), although in the studies 
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reviewed, the responses were not as complex as are the REAP annotation 
forms. A.furtTlfer note of caution came from early investigations of study- 
reading which involved precise or summary'writing and failed to show support 
for such an approach (Germane, 1921; Mathews, 1938; Arnold, 1942; Stordahl 
&Christensen, 1956). 

A major question arising from close examination of many related studies is 
whether the learning tasks and the strategies they induce actually reflect an 
increase in the quality of study or whether their effectiveness lies in their 
ability to increase quantity of study by lengthening tha*ime spent inspecting 
and reflecting upon the material to be learned. In the absence of persuasive 
evidence either way, inspection time was controlled in the present study. 

An additional area of tnterest that arose from a pilot stu,dy was how learning 
the REAP procedure qnight interact with a student's cognitive style. Since«it 
appears to require a degree of reflection not typical of impulsive learners, it 
was hypothesized thai they might most benefit from its use. 

This study, then, focuses upon the teaching and use of a reading/study 
procedure in which the central feature is a learning task, viz., writing 
annotations, presumed to have a positive effect upon the reader's learning 
strategies and thus upon hisjeading effectiveness. It was conductedas a 
teaching experiment in a natural setting and Centered upon three major 
issues: 1) the experimental subjects' degree of success in learning and using 
the procedure, 2) how such learning interacted with the subjects' tendency 
' toward a reflective or impulsive cognitive style, and 3) how subjects trainecnn 
the REAP procedure compared with control group subjects on several 
measures of learning from specific prose passages.. 

Method 

Participants in the. study were 105 students enrolled in a college reading and 
study skills class at the University Of Missouri-Kansas City. Each of the six 
sections of the class wefe randomly assigned to one of three treatment 
conditions. The experimental group received instruction and practice in the 
use of the REAP procedure. The experimental-control group received 
instruction and practice in using the SQ3R rrjfethod and related skills 
practice. The control group received instruction centered upon developing 
"affective readiness" for reading/study skills improvement followed by 
instruction and practice in study skills areas not directly related to study- 
reading or comprehension improvement. Each group received four weeks of 
instruction with the experimenter serving as instructor. 

During the first week of the semester, all groups received the NelSon-Denny 
Reading Test and the Reading Practices Mini-Inventory, an informal measure 
of reading/study habits. From these measures, it was determined that the 
groups did not have statistically significant differences in either basic reading 
achievement or in readingVstudy habits. 

Within the experimental group, measures were taken to determine the'' 
students' progress in mastering and e using the REAP procedure.. These 
students. were also administered the adult version of the Matching Familiar 
Figures Test, as used by/ Yando and Kagan (1968), to determine their, 
tendencies toward impulsive or reflective behavior. This cognitive style 
variable was examined for its relationship to the use and learning of, the 
procedure. * \ 

A two by three factorial design with repeated measures on the first factdr 
was employed to 'examine the experimental subjects' ability to «write 
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annotations before and after instruction across the three levels of the 
.''cognitive style dimension — impulsive, undetermined, and reflective.: 
' * During the sixth week, six posttest measures were administered. Two 
measures involved written recall of significant ideas in an informative article 1 
immediately following a fifteen minute reading/study period and 2) forty-eight 
hours later Two Objective recognition tests, based upon the same selection, 
were administered following the delayed recall, test. Then, a short 
philosophical essay was used to elicit two written reflective responses from 
each subject. All written responses were coded, randomized, typed, and 
ludged against previously established criteria: , 

Finally all students responded anonymously „to a self-assessment 
questionnaire designed to determine how useful .they perceived their 
instruction to be and how frequently they were using whai .they had been 

t3U A posttest only control group design was used for the comparative portion; ^ 
of the study. '* r . 

Results - " ' 

A two-way analysis of variance for repeated measures was performed on 
"the annotation writing pretest and posttest scores with the cognitive style, ■ 
groups serving as the second factor. Scores on the post measure were 
lignificantly higher than those on the pretest, F (1, 29) = 21.42 p< .Qpi.w.th 
the students obtaining a mean score of 17.63 on the pretest and a mean score 
of 23 69 on the posttest. • ' ..,, •«• » 

On the second factor, the cognitive style variable, no statistical y significant 
difference was revealed, nor was there a significant interaction. However an 
examination of the mean scores for each, cognitive style group revealed that 
the impulsives showed the most improvement irh annotation writing with a 
pretest to posttest. difference score of .6.7, while the reflective sublets 
showed oply a 3.9 point improvement. /maps 
Data from the six measures designed to compare the experimenta (RcAr) 
subjects to the experimental-control (SQ3R), and control subjects in learning 
in specific reading/study situations were subjected to a one-way analysis of 
covariance with the Nelson-Denny Reading Test total scores as 00 ™"™}™* 
only statistically significant difference was found on the test of immediate 
recall A Scheffe multiple comparison procedure applied to the adjus ed 
means determined that the difference lay between the control and the 
experimental-control groups. Examination of the mean scores on a six 
measures revealed that, in general, the control group made the highest 
scores, followed by the experimental (REAP) sublets, and then the 
experimental-control (SQ3R) subjects. , ha e , lWo nt cpH 

. No significant differences were found among groups on the student self- 
assessment questionnaire. All groups tended to "Agree Somewhat that the 
instruction they received had been helpful. Further, they reported using what 
they had been taught either "sometimes" or "frequently. , t 

Conclusions and Discussion 

Colleqe reading improvement students were successful in-teaming to write 
the annotation forms central to the REAP-procedure. Further, the experimental 
subjects as indicated by self-report, found the instruction in the REAP 



/ 



procedure helpful and tended to incorporate it into their study as frequently as 
their peers did more tradition instruction. 

Further, based on an examination of the mean scores, there was some 
indication that students who demonstrate an impulsive cognitive style may 
profit most from instruction in the procedure, perhaps because it requires a 
degree of reflection that they do not typically bring to reading tasks. 

However, this study fails to provide evidence that the learning and use of 
the procedure helped students achieve more effective learning from prQse 
when compared to students who received other training and when 
reading/study time was controlled. Several possible explanations -for these 
results must be considered. Ficst, it is possible that the learning tasks implicit 
within the REAP procedure are counterproductive. Secondly, the higher mean 
scores obtained by the control group may indicate that students had not 
integrated either REAP or SQ3R into their own reading habits thoroughly 
.enough to compete with other students who studied in their habitual manner. 
-Finally, the time controls may have been too restrictiye;. the .perceived 
helpfulness of the procedure may lie, not in any tendency to increase quality of 
study, but rather in increasing inspection and reflection time. 
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The effect of , 
creative thinking - reading activities (CT-RA) 
on reading comprehension 

This study, was designed to examine the effect of training in creative 
thinking on reading achievement. Central to its conception were the flowing 
considerations: 1) "creative reading," the acknowledged 
reading achievement (Smith, 1969). is apparently more talked abput^han 
taught 2) attempts to teach creative reading might Be strengthened though 
development of systematic teaching procedures; and 3) the Lanauag.ng in the 
Content Areas (LICA) thesis (Manzo & Sherk, 1978) provides a framework for 
designing procedures which combine creative thinking with reading-language 

"Creative process" is one of the three ra«rjpr notions embodied by the LICA 
thesis (Manzo & Sherk. 1978). This process, whether called "creative 
hSing " "divergent thinking." or "creativity." is often perceived as 
belonging only to those who produce great artistic or literary works. As de f ined 
bv LICA however "creative process." in combination with language learning 
and dialectical thinking, is a way of looking at the world, i.e. an approach to 
living rather than a specific ability or talent. Access to creative process is 
available to all through instruction, training, and practice. ■ ' 

Within the context of reading, creative process can be viewed as an 
interaction, or dialogue betwfen reader and text. Just as he creative i thinker 
active* responds ro the environment, so too does the creative reader respond^ 
to the printed page. This response, according to LICA, takes he former 
-dialecSSa thinking" whereby the reader converses w.tfv h'rn/herse« 
rearranging, adding and reorganizing information based upon, both prior r 

^S^^A Activities'(CT : RA) used in this stud^are 
designed to Stimulate dialectical-thinking through the mclus.on oU ajgyage 
interaction as suggested by the LICA thesis. Each activity equi res that 
studen t utilize creative thinking abilities; e.g.Jmproving products, .den if ymg 
pJobtems seeking new solutions, etc.. theh orally retrace their thinking *o 
determine how they arrived at the conclusions, and examine alternative 
ptssiSimes aTd/orlLs of reasoning. The critical 

mat 1) they require both divergent and convergent thinking abilities. 2) they. 
Suae articulation of ideas in the form, of a dialogue between teacher and 
student and 3) they are engaged in prior'to reading instruction so that thought 
presses used in the activity may be applied to the language-reading task. An 
example of this type of activity is: 

CT-RA Ask the student to list all of the ways she/he can think of to 
improve school desks. After the list is completed, discuss, each 
suggestion in terms of how it will make the desk better, how easy or 
difficult it would be to add, and hpw important the change would be^ 
Work Wth the student to. assess the likelihood of actually effecting 
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the changes: what would have to be done? what are the, major 
obstacles? what are the stages for development Qf changes? * 
should it actually be tried? < 



Reading-Language Application. After reading a story, choose an 
important object (significant to the story)~and ask the student to list 
all the ways he/she can think of to improve the object. Enter into a / 
dialogue concerning how these changes might affect the > 
characters, the story outcome, etc. ' / 

Method . 

Sample > / ' t 

The Subjects for this study were 33 students enrolled in the 197ty summer 
reading clinic at the University of Missouri - Kansas City. The students ranged 
in age from seven to eighteen, and displayed a wide range /Of reading, 
difficulties. / 

/ ' ' ' 

Procedure ^ ' / 

Students received individual instruction from Master's; Qegree candidates* 
in reading education. The students and their tutors were randomly assigned to 
one of three treatment groups. Students in the experimental group (n = 12) 
participated in v Creative. Thinking-Reading Activities, 



determined remedial instruction. Students in the experi 



ental-control group 



^ommonly used in 
remedial instruction. 



(n = 1 0) participated in games which are more 
reading/remedial instruction, along with tutor-determined 
Students in the control group (n = 11) received regular instruction which tbok 
the form of tutor-determined procedures for remediatioi of specific reading 
problems. The number of students enrolled for the summpr clinic fell far below 
expectancies, thereby sharply reducing anticipated sample sizes. Students 
arid tutors were blind to experimental conditions. j 

The study was conducted over a period of five weeks. T hree days of the first 
and last weeks were used for testing. During th§ experimental period, tutors 
kept daily records of the activities used, along with their own and student 
reaction to th§m: The experimenter attended all Sessions to observe 
instructional procedures and to answer to any questions which might arise. 
Data were collected to answer the following questions. 

1. Are there significant differences between the three groups with 
respect to reading achievement? Are these differences evident whdii 
initial achievement levels and IQ are held cpnstarjit? 

2. Are there significant differences between th$ three groups with 
respect to creative thinking behavior? Are thesi differences evident 
when age and !Q are held constant? 

Measures A i 

Individual intelligence test scores (WISC or WAIS) wer6 obtained along with 
age, sex, and grade data. I 

The Classroom Reading Inventory (CRI) (Silvaroli, 1973) was used as a 
measure of reading comprehension. Tutors administered alternate forms of 
this published informal reading inventory daring the first and last weeks of the 
study. 

The Torrance Tests of Creative Thinking, Form A (Torrance, 1966) were 
used as a posttest only measure ofjcreativity. 
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Results 



Readinq Achievement ,. . 

Since theCRl test yields scores from readiness thrxajgh eighth grade, it was 
necessary to transform grade-level scores into assigned scores in order to 
. quantify readiness, pre-primer, and primer levels.- The highest grade-level 
score in the instructional range was used for analysis. 

To examine differences which existed between the groups with respect to 
initial reading achievement and IQ, separate analyses of variance were 
performed on pretest scores and IQ scores. Mean reading pretest scores of 

4 7 6 4 and 4 5 were obtained by the experimental, experimental-control and 
control groups, respectively. IQ mean scores were 99.6, 102.6, and 99.2. me 
F ratios obtained were.not statistically significant _ 

Using a one-way-analysis of covariance, Classrpom Reading Inventory 
posttest scores were adjusted for the effects of initial reading ach.evernent 
and IQ. Adjusted mean r posttest scores for the groups were 6.4, experimental - 

5 4 experimental-control; and 6.0, control. A statistically significant treatment . 
effect TasTouV F (2,30) = 3.36, p < .05. The Scheffe ^^^SSSSSi 

.procedure was applied to the adjusted mean scores and yielded a s a istica ly 
significant difference between the experimental and .experimental-control, 
groups, F (2,30) = 3.53, p < .05. No other statistically significant differences, 
were found. 

0* 

^SeDa^teSyses of variance were performed on the verbal, figural and 
totaffaw ^scores from the Torrance Tests of Creative Thinking, ^statistically 
significant differences were found on any of these analyses. Analyses of 
covariance were then performed, with age and JQ as covar.ates. Again, no 
statistically significant differences were found. 

Discussion v 

'TS °pa f r?iai° a suppbrt the hypothesis that Vaining in creative thinking 
& useful as a means for increasing reading comprehension^ Findings from 
analysis of the Classroom Reading Inventory suggest that students _ trainee I in 
creative thinking benefit in their reading performance from such training. 
.Some support wis found for the hypothesis that systematic training in creative 
Thinking would yield differences between the three groups with respect to 
creativity v 

'TngeSSl. it seems that the Creative Thinking-Reading Activities which 
were designed to act as a warm-up, or catalyst* to creative reading, did 
increase reading performance., The CT-RA were engaged m prior to the 
Sna lesson itself They we e linked to the reading activity through a 
p'rocel wnfch allowed the'student and teacher to review th.nkmg stra egles 
and recapture them while engaged in story-rented tasks. Through such a 
procedure', the reader was then equipped and willing to add ° what was being 
read. This conclusion is strengthened somewhat by he fact that the : sublets 
• in this study were acknowledged;readmg "fa.lures who exh.bited a h Mjtoe 
myriad psychological, emotional and educational problems associated ^th 
Sfng disability That they ^would exhibit some measure of reading success 
with the CT-RA serves t ■> emphasize the potency of such an approach. 
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The effects of instruction in testwiseness 
in a college reading improvement course 

Testing represents the major portion of a college students evaluation and 
is often the main criterion for admission into graduate and professional schools. 
The average college student will take oveKlOO tests during his college career 
in addition to special examinations such as the Graduate Record Examination, 
Law School Admissions Test, and others. Knowing how to take tests is a skill 
that is essential to a successful college career. As college students become more 
"and more concerned- about their ability fo take tests, they are turning to 
reading and study skills courses for guidance and instruction in test-taking. 

Several investigators have examined the effects of teaching testwiseness. 
Populations have ranged from pre-schoolers to adults. The following con- 
clusions may be drawn from the studies on teaching testwiseness: 

1 Instruction in testwiseness is effective when its success is judged on the 
basis of an internal criterion, usually a project-developed scale of testwiseness 
(Gibb 1964 Moore, Schutz, & Baker, 1966; Wahlstrom & Boersma,, 1968; 
Slakter, 1969; Slakter, et al.; 1970; Moore, 1971; Pencheff, 1972; Woodley, 1972; 
Langer.etal , 1973; and Yearby, 1974). 

2 Although significant gains in testwiseness are usually found on internal 
criterion measures as a result of instruction, they frequently do not transfer to 
external criterion measures (Tinney, 1968; Woodley, 1972; Yearby, 1974; Kahn, 
1 976' Jongsm 1 975; Keysor & Williams, 1 977). Some investigators, however, 
have 'found significant differences on standardized achievement and aptitude 
tests favoring examinees who have received instruction in testwiseness 
(Oakiand, 1972; Callenbach, 1973; Sause & Grieco, 1973; Gaines & Jongsma, 
1 974; Eakins, Green, & Bushell, 1 976). 

3 The long-term effects of instruction in testwiseness haye yet to be demon- 
strated (Callenbach, 1973; Oakland, 1972). 

4 Ther p is not enough evidence, to suggest that instruction in testwiseness 
is more beneficial for particular subgroups of students, e.g disadvantaged 
students high ability readers, etc. (Tinney, 1968; Yearby, 1974; Jongsma, 1975). 

5 No particular method of -instruction (lecture, programmed texts, group 
discussion) is superior in teaching testwiseness (Langer, et al., 1973; Penchetl, 
1 972) 

The purpose of this project was to investigate the effects of teaching 
selected aspects of testwiseness to students enrolled in a reading improvement 
course The decision was made to limit the scope of the project to multiple- 
choice items The study sought to investigate the following research questions: 
'•1. Do college students demonstrate testwiseness in answering multiple- 
choice items? 

2 Can college students achieve significantly higher scores on a multiple- 
choice criterion measure of testwiseness following instruction in test-taking 

skills? 

3 What is the effect of test-taking instruction on particular testwiseness 
strategies 7 
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4. Is there a relationship between reading skills (vocabulary, comprehension, 
and reading rate) and testwiseness? 



Method 

Sample 

■ Subjects used in this study were 58 students enrolled in a noncredit college 
reading improvement course during the Spring semester, 1977. They ranged 
from freshmen through graduate levels. Most of the students were majoring 
in the social sciences and humanities. 

A preiest-posttest control group procedure was used for the study. Because 
the investigators were limited to using intact classes, classes, rather than 
individuals, were assigned to treatment groups. The investigators felt justified 
'in using this procedure because (1) there was no reason to believe that, 
classes differed significantly in composition or ability, and (2) the Nelson-Denny 
Reading Test scores would be used as -a covariate to equate for differences in 
reading ability, if ttiey existed. Both the control group and experimental group 
consisted of 29 students each. 

Procedure 

A self-instructional booklet was developed for teaching the selected aspects 
of testwiseness. The following six strategies were selected for inclusion because 
of their frequent mention in the literature: (1) eliminate incorrect Options;. 

J) look for repetition or redundancy between stem and options; (3) eliminate, 
milar options; (4) eliminate options which .include specific determiners; 
) look for longer, more precise options; and (6) be alert to grammatical 
inconsistencies. 

The qeneral pattern that was used in writing the booklet was to. introduce 
and explain each strategy using an illustrative item and then provide two 
additional items, with correct feedback, for practice in applying the strategy. 
After working through all six strategies, a six item review test was taken for 
added practice in applying the-strategies. 

The investigators developed a test that could be used to assess students' 
entering level of testwiseness and that could also be used to assess the effects 
of instruction in testwiseness. Essentially, the test was to contain items with 
constructs flaws that could be answered more readily by students who were 
testwise: 

A small core of items selected from Millman and Pauk.(1969), Ford (1973), 
and Slakter (1970) was used- as a base. The investigators wrote additional 
items forming an item pool of 60 items, consisting of ten items for each strategy. 
A panel of judges reviewed the items and identified the five which best 
represented each strategy. The end product was a 30 item test with five items 
for each strategy. The items were randomly ordered for the pretest so that all the 
items pertaining to a particular strategy did not appaltr sequentially. The same 
30 items were randomly ordered a second'time to form the posttest. 



Results 



The resuits of the data analyses have been organized around the four major 
research questions which guided this study. 

Research Question t j « 

■ As an indicator of demonstrated testwiseness, multiple Mests wen/ performed 

between the Total'Score Mean and the mean Total Chance Score and between 
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►he individual subsca'le means and corresponding subscale chance scores on.the 
pretest The results of the multiple Mests indicated that all subscale means and 
Total Score mean differed significantly from the chance scores with the 
exception of the subs'cale for Grammar. _ 

Research Question 2 • . 

An analysis of convariance was conducted, using the Nelson-Denny as the 
covanate to test the difference between total posttest means. The results 
indicated a significant difference between the posttest means for the experi- 
mental and control groups, F(2,55) = 54.79, p< 01. The experimental group 
'results "were: pretest M = 16.586 (SD = 3.33) and posttest M ~ 24 896 
(SD = 2 61) The control group results were: pretest M = IZ.IdHtau - o.ioj 
and posttest M = >9.310 (SD = 4.34). The combined results were: pretest 
M = 1 6 862 (SD = 4 31 ) and posttest M = 22.1 04 (SD = 4.53). 

Research Question 3 , 

Table 1 presents an experimental and control group comparison of mean 
percentage of students responding correct^, and gains or losses for individual 
subscales The experimental group demonstrated a significant increase over 
the control group in all six testwiseness strategies. 



Table, 1 

- Mean Percentage of Students Responding Correctly to 
Individual Testwiseness Subscales 





Pre 




Post 




Gain or 


Loss 




Exp 


C 


Exp 


C 


Exp 


C 


Elimination 


66.0 


75.0 


77.0 


77.4 


11.0* 


2.4 


Repetition 


59.0 


66.0 


,82.0 


69.0 


23.0**' 


3.0 


Similar Options 


63.0 


64.0 


87.0 


70.0 


24.0** 


5.4* 


Specific Determiners 


54.0 


62.0 


93.0 


67.0 


39.0** 


5.2 


Length 


54.0 


52.0 


86.0 


54.0 


32.0** 


2.0 


Grammar 


35.0 


39.0 


72.0 


49.0 


37.0** 


9.6* 



p<05 
P<01 



f^QSQBfCh Quostion 4 

The means standard deviations, and intercorrelations for the four Ne/son-. 
Denny reading scores and the testwiseness pretest scores for combined groups 
were computed. With the exception of Vocabulary, there was no significant 
relationship between reading skills and testwiseness. 
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Discussion 



College students, at least as represented by students used in this study, do 
possess a significant degree of testwiseness in responding to multiple-choice 
items. It appears that students were most effective in applying the Elimination 
strategy, followed by Similar Options, Repetition, Specific Determiners, Length, 
and Grammar. * 

Apparently, even though college students possess some degree of testwiseness, 
-they can significantly improve their ability to apply testwiseness through 
instruction in test-taking. There was a significant gain for eafch testwiseness 
strategy The greatest gain was found for Specific Determiners with the strategy 
of Grammar close behind. Elimination produced the least gain, although 
significant at the .05 level. The investigators believe that instruction had least 
effect on Elimination because, unlike the other strategies, this strategy involves 
some knowledge of content. That is, to apply the process of elimination, a test- 
taker must be able to infer which options are absurd or incorrect. The 
instructional booklet did not teach the content of the items. On the other hand, 
the strategies of Specific Determiners and Grammar are most susceptible to 
superficial characteristics that are independent of content. 

There appears to be little relationship between reading ability and testwiseness. 
The significant relationship between Reading Vocabulary and testwiseness may 
be due to an underlying relationship between verbal intelligence and testwise- 
ness ' Of course, this study cannot clearly confirm this hypothesis, but it 
would be worthy of further investigation. 
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A comparison of the effectiveness^ 
three study tephniquek for college students 



The recent increase of reading-study skills programs at the college level has 
led to a reexamination of the' study skills literature in an attempt to ascertain 
the effectiveness of various study techniques. Gates(1917) demonstrated that 
recall of prose and r^on-prose material was increased when, after reading, 
.subjects attempted to recite (review mentally) everything read. Further 
investigation into the effectiveness of the recitation-study technique met with 
mixed results. Peterson (1944) failed to find signjficant difference between 
recitation and read-reread groups. More recently, however, Del Giorno, 
Jenkins and Bauself (1974) adapted the tethnique to a 455 word prose 
selection with recitation occurring after each paragraph. They found that in 
both immediate and delayed recall, subjects who recited after reading 
recalled.significantly more of the passage than those students wharead and 
then reread the passage. 

Another attempt to investigate the effectiveness of study techniques has 
involved assessment of notetaking. A c6mparative stydy of the effectiveness 
of underlining and notetaking while reading conducted by Hcon.(1974) 
resulted in no significant differences in immediate.or delayed recall. However, 
Kulhavy, Dyer and Silver (1975) found superior recall with students who 
employed notetaking while reacting as opposed to students who underlined the 
text or studied the text. Researchers DiVesta and Gray (1972), and Fisher and 
Harris (1974) have proposed that notetaking provides an opportunity for the 
learner tb interact with the material which facilitates encoding. However there 
may be no need- for a student to overtly record his/her own ideas for this 
interaction to occur. Perhaps oy providing the opportunity for students to 
.mentally review the contents of the passage, as in the Gates recitation 
technique, the same encoding function would berserved. 

The purpose of this study was to assess the effectiveness of three study 
techniques as measured by a recall test of an 800 word. passage on both 
immediate and delayed tests-. The three study techniques were: 1) reading and 
then rereading the passage; 2) reading the passage with periodic mental 
review; and 3) reading the passage with notetaking. 

' Method 

Subjects m 

'The subjects were university juniors and seniors enrolled in an elementary 
education methods course .at The Pennsylvania State University who 
volunteered to participate in the study. Forty-eight students were randomly 
assigned to one of the three treatment groups. 

« 

Materials < < 

The stimulus materialised in each of the treatment groups was' an 800 
word passage about vitamins developed by Sagaria and DiVesta (1977). The 



passaqe was divided into ten paragraphs of sWn sentences with each 
sentence containing one idea. A recall item was constructed for each 
sentence. The resulting 70 questions thus represented all the factual concepts 
presented in the passage/Through the use of a computer processing program, 
40 questions were randomly selected from the domain 6} the 70 items to make, 
up a separate test for each subject. Each set of 40 items equally sampled he 
information from the ten paragraphs. Once selected, the order of the 
questions in each recall test was randomized to reduce the likelihood that an 
item could be answered because it duplicated the sequence of the material 
within the passage. , 

Procedure K 
The data for the study were collected during twa sessions. During each 
session students met in small groups of four to eight. At the beginning of the 
first session the "stimulus material was disseminated and students were 
advised that upon completion of the study task they would be administered a 
40 item recall test. They were; also informed that in one week's time they 
would return fo r a sim ilar test on t he same material 



Oral instructions on the~study technique "To D-e-DSernwere ^iveTi-tcreach •■ 
treatment group. Students in the read-reread group we're instructed to read 
through the material as if they were reading an article, not stopping to _ mull 
over ideas. Upon completing the passage they were instructed to reread it in 
the same manner Students in the read-mental review group were instructed. to 
read the passage one paragraph at a time. After reading each paragraph, they 
were to look away from the passage, attempt to recall the information in the 
paraqraph from memory, and then look back at the paragraph to check to see 
if they had missed any important information. They were instructed to continue 
this process until the paragraph was mastered. Students in the. read-notetake 
qroup were instructed to follow the same basic procedure as the read-mental 
review group. However, they were to write the information recalled after each 

Pa A| 9 Sudents were required to record the time they used to read/study the 
passaqe from a digital clock at the front of the room. Upon finishing the 
readinq/studying, each student was given hisfher 40 item short answer test. 
During the second session, one week later, the students were administered a 
different set of 40 items randomly selected from the domain of 70 items. 

An3lysis * 
In order to assess the performance of the three study groups on the 
immediate and delayed recall tests, a 3 X 2 analysis of variance (study- 
•condition by test-condition) was computed. In addition, the study time factor 
(minutes used to study) was also analyzed with a one-way analysis of variance 
with the three treatment groups serving as a between subjects factor. Level or 
significance was set at the .01 level, and follow-up pairwise comparisons were 
made via the WSD procedure. 



• — - - . Results 

The study condition by test-conditjon analysis of variance revealed 
significant main effects for both study condition, f(2,45) = 6/4, p < .01, .and 
test condition, f(1,45) = 143.17, p < .01. and a significant interaction f(2,45 
= 8 46 p < 01 In the immediate test condition the follow-up comparison of 
study condition effects using the WSD procedure indicated that there was no 
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significant difference between the performance of the read-notetake„6flfrread- 
' meptal review, groups (mean number of correct responses = 29.9 and 29.0). 
However both were significantly different Irom the read-reread group (21.0). In 
the delayed test condition, althc^jgh the same basic trend irf means occurred 
among the read-notetake, reac^mental review, and read-reread groups (20.0, 
20.6 and 1 7.6) there were no significant differences among any of the groups. 

The one-way analysis of variance of' the study time factor revealed an 
overall significant main effect, f(2,45) = 61.08, p < .01. The follow-up 
comparison using the WSD procedure indicatedlhat the read-notetake, read- 
mental review, and the read-reread groups (mean study time in minutes = 
26.92. 17.38 and 8.75 respectively) were each significantly different from one 
another. 

. Discussion 

The results indicated that on an immediate recall test, the study techniques 
of notetaking and mentally reviewing produced significantly superior results to 
the. rereading technique. In the delayed, tesj condition, although the trend in 
means was the same, there were no significant differences. Significant st udy 
time differences existed among all 3 groups with notetakers using the most 
time and rereaders the least. - ' 

For immediate recall it appears that both notetaking and mentally reviewing 
the passage are effective means of studying. However, it is inappropriate to 
ascertain effectiveness without attention to study time. If it were assumed that 
difference in study time could account for difference in. the amount of 
information recalled then a direct • relationship between the amount of 
information recalled and the amount of study time should exist. However, this 
relationship is not -present. Although the notetakers spent significantly nhore 
time studying, their immediate recall was not significantly different than those 
students who mentally reviewed the contents of the passage. We can 
conclude, therefore, that for immediate recall the menta^review study 
technique was the most efficient study method, Further, it appears that the 
encoding function served by notetaking is also served by mentally reviewing 
each paragraph read. 

These effects do not persist in the delayed recall condition. Neither the 
hotetakers nor the mental reviewers performed significantly better than those 
, students who merely reread the material on the delayed test Another function 
served by notetaking, according to some researchers {DrVesta & Gray, 1972), 
is that of providing a means of external storage of essential information from 
the study passage. In this experiment subjects were not allowed to make use 
of this opportunity. By providing students review time under the delayed 
condition a more typical study situation would be replicated and hence more 
definitive conclusions regarding study effectiveness could be made. 

In summary, the read-mental review study technique was the most effective 
technique for. immediate recall with the consideration of the study time factor. 
Conclusive 0 evidence regarding study technique effectiveness for delayed 
recall was not provided by this study but was recommended as an area for 
further research; 
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The competencies that junior college chairpersons 
expect from their reading course graduates 



Remediation courses in reading and study skills are rapidly expanding within 
t the junior college curriculum. These courses have been instituted to s{em high 
^-student attrition by preparing Students to succeed in either regular college 
courses or in their areas of vocational interest (Ahrendt, 1975; Sweiger, 1972; 
Roueche, 1967). 4 
The diversity in organizational structure of these remedial programs is evident 
. from a review of the literature in the field (Covington, 1977). Some colleges 
'have just one reading course while others have' as many as four or more 
courses. Research in the fiel0 of developmental studies (Gwynne, 1973; 
Sweiger, 1972) has shown the predominant goal of these courses to be one of 
preparing the student for "his years ahead in college" (Sweiger, p. 7). 
* With such a diversity of courses that aim for student success in regular 
'college courses or "immediate personal, vocational competence" (Huffpnan, 
1975. p. 4306a), the need arose to ask of reading department chaipp^jsons, 
> "What, precisely, are the competencies you expect from your heading coufse 
graduates 9 "' The necessity for such a question is apparent. Is there a core\of 
reading skills that all reading graduates should possess for basic literacy, 
personal satisfaction, and minimal vocational competence? Beyond this core, arte 
there other reading skills in which reading course graduates should be proficient 
for success in regular college courses? \ 
A st.udy was conducted to determine what chairpersons considered to be\ 

(1) the major components of junior college developmental reading courses, \ 

(2) competencies expected from reading course graduates, and (3) the areas 
in which such courses could be improved. An additional analysis was conducted 
to determine whether there were significant differences in the responses given by 
chairpersons responsible for one of two remedial reading courses and chair- 
persons responsible fc for three or more such cqurses. 

Method 

Questionnaires were mailed to fifty-nine chairpersons of junior college-reading 
and study skills programs in Texas. Twenty-eight of these chairpersons 
responded. Each was asked to rate twenty-one items according to the importance 
of the item in the developmental reading program. A five-point scale from 
Highly Important to Unimportant was used for* this purpose. Percentage 
distributions were obtained for e^ch of the five categories. 

Ultimately, these categories were reduced to two: the positive category 
consisting of the responses obtained from Highly Important and Important, 
and the negative category consisting of responses from Somewhat Important, 
Relatively Unimportant, and Unimportant. Although the items cited as Somewhat 
Important could have bee'n positively categorized, the researchers chose to 
include in that category only those items clearly delineated as strongly positive 
ones by chairpersons. 

) 
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The binomial distribution (Parsons, 1974) was used to determine critical, 
values thereby establishing the groupings of course components designated 
as Least Important, Important, and Most Important, as shown in Table 1. 

In addition chairpersons were asked to write an essay in which they 
commented on the competencies expected of their reading course graduated 
and cited the .improvements and amendments, if any, that would enhance the 
present reading program. ' *u 

Finally, in order to determine whether the responses of 'chairpersons with 
smaller departments were significantly different from the responses of chair- 
persons with larger departments, the twenty-eight chairpersons were divided 
into two groups. Group A consisted of seventeen chairpersons whose programs 
consisted on one of two courses Group B consisted of eleven chairpersons 
v&'ith three or more courses. An analysis of variance was conducted to 
determine whether there were significant differences in the responses of the two 
groups to each item 

Results 

Six of the twenty-one reading and study-skills items were deemed Most 
Important components of a reading course. Four of these six items were in the 
area of contextual comprehension. Seven skills were considered to be Important. 
Eight were deemed Least Important (see Table 1 ). 

* Junior collegeVeading chairpersons expect the graduates of their courses to be 
able to demonstrate competencies in at least four of the course components 
designated as Most Important, namely the skills of comprehension in context. 

The answers to the question, ;'What ^improvements and amendments, if 
any are needed in the program?" were all provided from the responses in the 
• chairperson's essay. In the area of program philosophy, chairpersons recom- 
mended an overall improvement in communication between staff and students 
so that all understand the importance of "reading in everyday life, in subject 
area courses, and in career selection. Further, chairpersons suggested that 
communication between staff members and remedial students would be en- 
hanced if the staff members would more frequently approach remedial students 
with confidence as people who can become proficient in reading. 

For improvement in student self-image, chairpersons felt that spiraling 
courses Would aid the remedial student better than separate departmental 
remedial courses because disciplinary reinforcement and counseling could be 
more easily integrated. The courses in the program, they suggested, should be 
arranged from basic reading upward, with special multi-disciplinary labs and 
learning centers staffed by a team e of professionals, para-professionals and 

tutors , ... 

Other aspects of the program should include the granting of credit, pass- 
fail' grading, small teacher-student ratios, non-registration and continuous 
enrollment for lab students, and flexibility within the program structure. 
. The teaching techniques suggested would include diagnostic-prescriptive 
methods continual assessment, independent reading time within the program, 
modular lesson planning, group as well as individual (self-paced) instruction, 
and more creative teaching. 

Other improvements suggested were for more diversified, multi-level, self- 
paced programmed, and ethnic materials. A final suggestion was that com- 
puterized help in selecting materials appropriate for certain deficiencies be made 
available to reading and study skills personnel. 

The^were no significant differences noted in the type of response given 
by c^Birpersons of large departments and chairpersons of small departments. 
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Table 1 

Level of Importanceof Reading Course Components 



LEVEL OF IMPORTANCE 
MOST LEAST 



CATEGORIES 




COMPONENT 


IMPORTANT IMPORTANT IMPORTANT 


CATEGORY I STUDY SKILLS 


A. Listening and 


1 


Notetaking 


X 


Notetaking 


2 


Outlining 


X 




3 


Listening Skills , 


x 


B Research 


i 


Dictionary 


x. ■ * 




2 


Library Skills 


X 




3 


Research Skills 








for Content Areas 


X 


C. Test Taking 


1 


Test-Taking techniques 


X 


2 


Test-Questions Analysis' 


X 


D TextbookStudy 


1 


Textbook Organization 


X 


2 


Study Methods 








(SQ3R, etc.) 


X 


Category ii reading skills 


A Comprehension 


1 


Basic Comprehension, 




in Context '* 




Main Idea, Details, etc 


X 




2 


Following Directions 


x 




3 


Advanced Comprehension 


X 




4 


v Vocabulary in Context 


X 




5. 


Syntactic 




* r 
• 




Understandings 


X 


. B Conceptual Thinking 


1 


induction" 


X 




2 


Deduction 


X 




3. 


Concepts 


X 






- (Cause & Effect, etc.) 




*C Speed m Reading 


1 


Flexible Reading Rate 


X 


2 


Speed Reading 


x * 


O* Phonics 


1 


Phonics 


X 




* 


Conclusions 





Significantly, the skills deemed Most Important by chairpersons illustrate 
their feelings that the major course components (and ultimately, the competencies 
of the" remedial graduate) u are, in the main, the mastery of . basic .reading 
skills. The fact that chairpersons of small as well as large reading depart- 
ments gave /esponses that wer^e not significantly different further illustrates 
that certain basic skills are seen across the board as the major components 
of remedial reading courses. » 
* ' Sorj]e of the more advanced skills in the areas of "Research" and "Con- 

ceptual Thinking" were cited by chairpersons as Least Important for a remedial 
program. The "Conceptual Thinking" skills aridUhe entire list of Study Skills, 
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however could be of extreme importance in aiding students for future 
college success Half of the Study Skills items were rated Leas? Important 
with only two of these skills attaining' top priority in the reading program. Until 
lectunriq is replaced by other methods of disseminating content information, 
chairpersons may need to teach notetaking' and outlining. They can thereby 
prepare remedial students to take class notes and rewrite them in orderly 
outlines for better intellectual processing. . 

If the findings of this study truly reflect, the expectations that junior 
colleqe reading chairpersons hold regarding the competencies^* their reading- 
course graduates, we suggest that they consider ^moving toward broader 



„ expectations. 
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G 1975-1976 review of research in college-adult reading-; 



The last review of research in college-adult reading was published in 1975 and covered the 
area through the end of May 1974 (Blanton and Smith, 1975). This review includes research 
which appeared in the literature for the two years 1975 and 1976 and covers approximately 
June 1974 through May 1976. The reports reviewed coyer the published literature^ directly 
concerned with research .in college-adult reading except in some instances where allied 
references have, been included in explanatory sections or discussions of issues which span more 
than a short tjme period. 1 

The articles reviewed have been organized within the following broad conceptual framework*. 

Surveys of follege-Adult ReadJngprograms 

Characteristics and Needs of College-Adult Readers 

Evaluation of Programs and Ins^uctional Methodology • *^ 

Rate and Flexibility 

Readability, Materials/Textbooks ' , 
Test Development, Evaluation and Use 

Word Recognition . ; 

Sentence Comprehension 
.Learning From Prose 

Surveys of College-Adult Reading Programs 

Smith, Ennght and Devinan (1975) reported results from a nationwide survey of learning 
and study, skills programs using an all-inclusive definition of learning center. Results indicated , 
that 61 % of* 'respondents were running learning study skills programs with an additional 9.3*% 
indicating plans to develop a program No established pattern of administrative responsibility 
has yet emerged and course credit is stilj a vital issue. Maxwell (1975) presented a Case . 
study snetching the development, program philosophy anc! operation of a university learning center. 

Reading -and study skills programs in selected Rocky Mountain colleges and universities 
were surveyed by Warren (1975) who found 84% of institutions operating programs' With 9% 
planning courses Two year institutions offered fewer courses than did four year ones bat 
served more students. Over three-fourths of the courses were offered for credit. with wide 
variation existing in use of machines, materials and instructional procedures. Few instructors 
had formal academic training in reading. Programs were aimed more toward minority and 
academically troubled students than toward total school populations. 

Manikas (1974) analyzed past and present remedial reading programs for underprepared 
black junior college "students, identified components £om 30 successful programs, and presented 
a model remedial reading program for such students. 

Survey data were analyzed by Haviland (1974) to determine the status of * publicly and' 
privately administered illiteracy programs in Great Britain. Programs have grown rapidly, 
particularly since 1967, and awareness of the problem is increasing. However, current provisions 
for dealing with the complexity and magnitude of the problem are inadequate. 

Kinnamon (1975) reported results of a questionnaire survey sent to 160 commercial TV stations 
in the U.S. and Canada to determine type and number of programs offered which are 
designed specifically to teach adults to read. Program characteristics are described for the few' 
stations offering them. 

Appreciation is expressed to Donna Dartneii and Catherine Tolsma for aid in collecting and 
organizing data and to Catherine Tolsma for manuscript typing. 

Acknowledgement for funding and released time is made to the UBC Faculty of Education 
and the Reading Education Department. 
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Characteristics and Needs of College-Adult Readers 

.. .|. i t- r. -N. 

jJv , . g mm. .r j , r, trw n^ding ability of college freshmen Larsen. Tillman and 

" ,<>>.'hi ana wed the resets of a decade of testing for all incoming freshmen at the 
f >» F cnda As measured by the DRT reading ability was highly stable for the ten 
•oo ^eO-^o ev ^ n a s^eabie increase m admissions requirements (achievement 
w-. r- 1 1k1 n.M 'esi.it m statistically sign-fioant changes m reading score trends 

am r . ,>!\ n» K j,»-^ Novai t 1 9 4 \ tested the assumption that entering medical students have 
. . Mv>\-»r deponed adequate reading- sMis using NDRT scores for 268 students entering 
a t\v i Me<jK a r.i-ege as me classes of Analysis of scores and careful review of 

,u.t..v>m"v 'Uo'ds r»*\,ea»ed that m at Umm halt the cases ot academic failure reading deficiencies 
"dv^j a l» .fv.f-ent [Vnne .Ui/Si a'so examined problems of admission and academic 
\" Ves^.^ " -\!;ca; stvjder^ts usmg McGraw-Hill Basic Skills System Reading Test data for 85 
■.•",.]►"•'•., at rr, e un.vofsitv ot Utah No correlation was found between reading test scores 
r 'l K^'^'ren! >n :ne f >'s! two quarters of work m medical school and the decision was 
•• "i -"'.j ijjJe a rnavs.ig test as part of the-admissions procedure 
Hi^sPv i'^Mi inves' gated differential language behavior between vocational and academic 
.. jM r. . V f- 4 j <vai i*'A language inventory (free reporting), the DRT. the Otis, Purdue Fegboard 
I-!: ,Mr l "r«i Allude Test Bauer v 'or tit) randomly selected subjects enrolled in programs in a 
M^., ^ |U " o' .v'ege ANCOVA covarymg on 10 indicated no significant differences in 
cwad grrat-c 'espouses and significant differences «n favor of the academic group In reading 
V-*v C- >■•»»■ jiv'm; anaivs-s yeided nonsignificant correlations between paradigmatic responding, 
,.i j r» j ; o »r--i ; S ,»rt, -I "\irve measures 

a.dv •"i'!tMnj,».oav was employed by Wiggins (1975) to gather data on negative 
».■ v ->r r T »./ »j. 'k' hs -n home and school intellectual level, interests, and personal and social 
i-j, iS »rr.,»ii • r B'ac* mae remedial reading freshmen m a small Southwest church college It 
a i-,?. -re ii, »]»•■ i " a' :i-,an.!i!ios and specific dencts represent a cumulative educational deficit 

i j Vim ( .*art-'ii. ( } the relationship between SES. mode of linguistic expression, attitude 
»■ a i* j rid M, r.'na n f^adrng evei of aspiration community environment, age. sex, and 10 with 
'Had" ' ab< -»v j' 1 j ,r *q L'nr r^iat'nnai and multiple regression techniques based on scores of 60 
,.»V ■ >n;ar, :o>'^ge students 10 made the greatest contribution to reading ability and the 
fAli rrajt lfl w^y.i tors ot college achievement were characteristics of the home and reading 
\l '< StS ^ar-abies atMude 10 and some of the interest variables were significantly 
■-Ji'ea ^ -eadrg ^k.Ms Poorer r traders speech represented a restricted code while better 
■nadr"-. e^er^O' ' * i d *ne elaborated code 

,a? aD^s .?e cnange measured by the Social and Collegiate Readjustment Rating 
Ov/.^M'e ND«T achievemenf course-hour work load, and selected attitude factors were 
ras^d Vi Herard {ig. T b> as predictors of year end academic performance for freshmen at 
W»v Tr»<as Cnm^un.tv College Descriptive analyses and ANOvA indicated reading 
t , er' ajs an effective predictor of GPA, course load and selected attitude factors while 

-arg,» . V s.i waot.on of reading ability and life change effectively predicted only 

X-'e" ^M^LO'reatea scores on the Library Orientation Tests, ihe SSHA and sex with GPA 
4« ff e Vn^pn E^g> s^stude' K at a small metropolitan university Intercorrelations among all 
;'Jj S) h H oe^u'a' ons of '*e pred cto' variables and the criterion measures suggested the tests 
A^^'at-ve-v '"dependent of eac* other The Library Orientation Test appeared to be valid for 
• v r », a-.- ' g ,u( i ess n college Reuifvery little variance was added with the' use of the SSHA, 
vnrle.1 »o acneve aha higher level than men and also achieved a higher predict*- GPA 
S" '« • • -j ? 4» j is nq 500 junior college students, examined the effects of Men ind. ^nde'rjt 
vjrat..»s o p resduai reading gam using stepwise multiple regression, t tests and chi square 
3"a-vs s a n d f ound unfj'fshed readmg potential, number of semesters in reading instruction and 
r * a r *Md r -g ab' .ty :o be relevant predictors ■ , 

,-'4 p. lt -,/, .ions >i* the use of GPA as the criterion variable in characterizing needs and 
ill ,, > ,„g < reading msfud'on continue to be reported >n the literature (Farrell, 1975) 

W-s.^ .t i'4i .ttla stepwise multiple regression program to determine the statistical relationship 
De'we^ ro'SegeUPA and reading >ndex of freshmen students at Florida junior colleges 
f.^d'-qs ^eveaied a pos»t»ve and significant relationship between reading ability and GPA 
and fW"i iV^Sa b) evaluated the contradictory research examining the relationship 
-p-a^v 'ead.ng ah ' v and C.PA In research usmg 120 college slud.enfs (NDRT and course 
"a<v,i • wa^ 'rv.n.j tn.ai **s * nf the grade variance was accounted for by vocabulary 
'\^r[f> ^]rr^ Ad s r e tated to r eadiT)g achievement only, insofar as reading was 
»o"aV D e no rr DO rart':- »h e course Haseiton ( 1 974) used ANOVA and correlational 

specie f eadmg a^d study skills, and faculty perception 
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of those skills, to GPA in each ot 10 schools at Oregon State University. A distinctly 
different pattern of reading and" study skills emerged for each df the ten schools. Paragraph ^ 
comprehension correlated significantly with GPA in 9 out of 10 schools and scores ior skimming 
and scanning correlated more positively with GPA than any other scores although identified by 
the majority of instructors as least important 

Adult Readers -General Characteristics 

DePietro (1975) attempted to predict achievement, attendance and dropout incidence in adult 
basic education projects for 397 students in four programs randomly selected from* 16 Right to 
Read projects in Northeastern urban areas The predictof variables of enrollment, age, sex, and 
formal educational attainment did not, predict attendance or attrition; teacher directiveness did 
predict literacy achievement t . * 

The oral reading performance of samples of functionally illiterate adults and second and third 
grade developmental readers were compared by Russell (1 974) Point biserial correlation coefficients 
revealed significant similarity of error patterns between the two. groups foi all categories except 
syllabication when reading at the frustration level 

Mizer (1975) compared university students 50 years of age or older with an analagous gr.oup 
not enrolled in formal education MANOVA indicated significant differences in life satisfaction, 
reading proficiency and interpersonal values but not for selected background characteristics. 

Based on a follow-up study of 40 adult males who were clinicallv diagnosed as dyslexic 
in childhood. Prauenheim (1975) reported that results of oral, comprehension and vocabulary 
reading tests and arithmetic and spelling achievement measures, administered an average of 10 
years | months after diagnosis, revealed that subjects had remained severely retarded in reading 
(mean reading grade score 3.6), with concomitant effects on all aspects of life adjustment. 

Personality Characteristics 

General' Trachtman (1975) tested 50% of the incoming freshmen class of disadvantaged • 
minority students entering a special program to determine the relationship of cognitive, demo- 
graphic and certain intellectual achievement variables to academic performance. Step-wise 
multiple regression analysis revealed reading, attitude toward authority, dogmatism and selected 
items of internality-externality^to constitute the best combination of predictors, accounting for 
3S% of the variance 

Tillman. Mitlot and Larsen (1974) analyzed the results 'of five studies which explored the 
relationship between personality type and reading test scores, hours worked at the reading 
center enrollment in voluntary reading programs, GPA of marginal students and reading test 
scores Two tentative conclusions were reached persons who state a preference for dealing 
with abstractions and symbols tend to be more effective readers, those who prefer to please 
other people tend to benefit more from individualized helping situations. 

The interrelationship of study skills and personality variables was studied by Rutkowski 
and Domino (1975) using data from 201 university freshmen who took the California Psychological 
Inventory the SSHA and the Scholastic Aptitude Test Results of the factor analysis cautiously 
support the independence of study skills, personality variables and aptitude scores. 

Hoiberg Hysham and Berry (1974) replicated a previous study and compared discharge infor- 
mation on Navy recruits assigned to. a remedial training program with that for a control group 
to assess the psychiatric implications of illiteracy The percentage of neuropsychiatnc and dis- 
cplmary discharges during the first year of active duty was significantly greater for the illiterate 
group I 

Treppa (1973v using the InLipersonal Check List, terms students who enroll in college 
study skills courses to be moje self-rejecting, passive, and to perceive their parents more 
negatively than a control group of subjects 

Self-Concept Musik (1974) analyzed pre and post treatment data for the DRT and a researcher 
developed self-concept instrument for community college students randomly selected from a 
reading improvement course and assigned to one of three treatment groups diftering in their 
relative emphasis on reading or self-concept training They concluded that the combination of 
reading and some training in seif-concept was most promising for reading improvement 

Lund and Ivanoft (1974) utilized 227 students to determine the difference in measured self- 
concept among college freshmen blocked by reading ability, sex and enrollment in a reading skills 
program All variables papftcularly course enrollment, were sensitive to self-concept variation. 

Internal-External Locus of Control Allen and Harshbarger (1977) compared the performance 
of internally and externally oriented college reading improvement students taught in a student-con- 
trolled or an instructor-controlled program ANOVA and pairwise comparisons revealed that 
'nternally-oriented students in ^classes highly controlled by the instructors made significantly 
smaller reading achievement (DRT) gams than did all otrter students Drummond, Smith and Pinette 
0 975) investigated the extent to which l-E construct related to student achievement in an 
individualized community college reading program Results for 30 male students revealed greater 
achievement for externally oriented students 
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Interests I 

McCrea/h (1975) assessed reading habits and interests and their relationship to reading- 
I'mpfoverrfeni tor 89 disadvantage^ junior college students enrolled in reading improvement-* 
courses juenerally students were not regularly involved in reading activities. Newspapers and 
magazines appeared to be (airly regular reading with pleasure reading, discussion of reading and 
use of liblanes scoring low Patterns and habits were independent of age 

Several stud'es assessed various factors related to newspaper readership and readability,, 
among adult populations Penrose et al (1974) used a sample of 1,130 randomly selected 
respondents in North Carolina to compare the profiles of newspaper and non-newspaper readers 
with me results of a study conducted m Wisconsin a decade earlier. Results suggest the same 
type of people (low SES '.rural, low education) continue to be the non-newspaper readers. 
•Schweitzer (1976) analyzed young people in a metropolitan area to determine the factors involved 
m subscription or non-subscription to two daily newspapers Chi square tests revealed statistically 
Significant differences between subscribers and non-subscribers on marital status and occupation; 
lifestyle mobility and educational variables were not related. Bryant, Currier and Morrison (1976) 
analyzed' a sample of 900 adults from the Toronto metropolitan area to determine if lifestyle 
rather than demographic factors relate to newspaper choice, resulting in a new technique for 
defining readership 

Attitudes 

Stevens it 975) assessed change scores on a reading test and a semantic differential instrument 
for adult students in eight reading classes using a correlation matrix Significant relationships 
between initial attitudes toward reading and reading skills improvement, but not between attitude 
cnange and reading score changes, were noted 

Physiology * 
Reichurdt 1 197 7) used a polygraph to measure tte physiological reactions of reading disabled 

students (median age 21. 1st to 30th percentile-Davis Reading Test) and reported statistically 

significant differences between their reaction to oral and silent reading tasks. Gross abnormalities •• 

m respiratory patterns were found to exist among long-.term reading handicapped students. 

From the standpoint of arousal, two distinct groupings emerged: the lowest scoring readers were 
. found to be hypo-tensive while trjose scoring somewhat higher were hyper-tensive. 

Wark (1974) compared validly test-anxious with validly test-cool psychology students in terms of 

their heart rate patterns while resting and while reading to answer test questions and reported a 

deaf distinction on heart rate pattern between the two groups 



Evaluation of Programs and Instructional Methodology 

A nvelv interest continues in the evaluation^ program effects and instructional methodology in 
conege-adult reading, particularly among doctoral candidates. Two-thirds of the research reviewed 
in this section was reported in doctoral dissertations 

Romtne (1974) polled students and faculty in 29 junion-community colleges in 15 states using 
a 70-item instrument designed to assess attributes of an effective instructional climate. Factor 
analysis produced an instrument consisting of 41 attributes which make a significant contribution 
to instructional climate Characteristics are grouped under the headings of instructor preparation, 
piannmg and organization, presentation, and assessment and outcomes. Intercorrelationfc and • 
'reliability analysis reorganized the data further to produce nine major personal and academic 
characteristics which typify an effective college instructional climate 

Robm (1976) reviewed the nature of behavioral instruction; its effectiveness in terms of academic 
achievement retention attitudes, study time, and withdrawals; and assessed the contribution of 
various components of the individualized method including self-pacing, unit perfection requirements, 
stress on- writing use of proctors, study objectives, grading, lectures, and student/component 
interactions 

Fairbanks H974 1975) examined 87 reading study skills programs and developed a system of 
rating the degree of research control utilized in reports of their effectiveness. Fairbanks concluded 
that poor research design and control may wetl have inflated results concerning reported 
effects of college reading programs on GPA She suggested that variables such as content and 
mode of operation of the program, response to program variables, student personality variables, 
entry reading skills, scholastic aptitude and achievement, academic curriculum anct staff enthusiasm 
be studied further Par ham (1975) studied the relationship of persistence in participating. in a college • 
level reading course to six variables including GPA. sex. personality, entrance reading proficiency, 
general scholastic ability at entrance, and the type of curriculum being pursued by the student. 
Persistant and non-persistent students did not differ on any of the variables. 
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Burgess, Cranney and Larsen (1976) investigated short and long term effects on GPA of a free, 
voluntary noncredil reading improvement program using 46 freshmen as a treatment group and the 
entire remaining freshman class (N - 3,000) as a control group. Characteristics of the voluntary 
program were Similar ><> those noted for successful programs in the Fairbanks (1974, 1975) 
studies Significant deferences in GPA s favored the treatment group at the end of the first, 
and second term 

The effects of trvee treatments (counseling, academic aid. no treatment) on college graded 
reading scores and attrition for 71 low income freshmen were studied by Cochran (1975) • 
Significant positive relations were found between reading scores and GPA, however, ther.e were no 
differences for either NDRT reading change scores or GPA 

Hoiberg. Hysham and«6erry (1974) compared 1 ,51 7 enlistees'who attended an academic remedial 
training program with a matched comparison group Higher values of final reading level, the AFQT, 
and educational attainment were all predictive of military effectiveness The remedial training 
pf ogram did aid many men to stay on active duty to perform effectively in the Navy. 

Carpenter and Jones (1975) used the NDRT to measure gains in comprehension and rate for 
students enrolled in reading classes at Clemson University Significant gains were found for subjects 
taking reading classes Large 'gains in comprehension were attributed to the sequential acquisition 
of separate comprehension, sub-skills and their subsequent integration and application to content 
materials . 3 * 

Tne influence of reading classes on the reduction of phonic disabilities of first year students 
was studied bv Lttreh 1 1 976) Experimental and control groups were analyzed consisting of 
students- enrolled m English and elective reading classes in -a Kansas junior college. ANCOVA 
fesu^ts of pre and post scores on the California Phonics Survey showed significant gains for 
reading ' cl^ss students as compared to controls enrolled in English classes and significant 
gain*, for the reading classes in comprehension as measured by the Diagnostic Reading Test. 

Admission Students Papandrea f 1 975) compared self-concept and achievement change 
scores of learning iaboratory participants who were in the lowest quartile of their graduating • 
classes with a control group of potential learning laboratory sfudents ANCOVA, correlational 
analyses and t tests indicated mixed results-'in comparing the two groups on reading achievement, ' 
anthmetic set' concept and English No significant relation was found between self-concept and 
academe achievement Rakes Lmtz and Payton d 974) found'when comparing student achievement 
and atftude ( n applying competency based and traditional teaching programs to four groups of 
predonnnentty ■ Black students, that competency based classes were superior \to -traditional 
classes on the California heading Test Student attitude toward competency based teaching was 
oos:tive Out the costs of operating such classes were, three times those for traditional ctesses. 

O Donne'i f 1 974) compared academic skill gains of 42 Black high-risk college freshmen. 
aUend-ng a summer program assigned to tutoring in the content of the traditional course or 
storing 111 one hour sessions college upperclassmen) using a syntactically oriented' psycho- 
Mfig u .btic program MANOVA and t tests indicated no significant differences between the groups in 
reading acnievement 'syntactic maturity of written* composition .and attitude However, Whittaker 
iT~J^4» found a linguistically based instructional method to be significantly more effective than a 
prcq r ammed method in improvement of comprehension but not vocabulary for a 16 week reading 
and s*udv ski'ls bourse ■ 

Weiss i using ANCOVA found differences in favor of a language experience approach 

over a traditional approach for total reading .and comprehension .but not vocabulary Kahn (1975) 
con? pared reading achievement of students receiving a joint program of applied psychology and 
communication skills for two hours three times a week for fifteen weeks with that of students 
receiving oniy.comrnuntcation skills instruction for trvee one hour sessions each week for fifteen 
weeks No deferences appeared on .the pos[test analysis although the psychology plus communi- 
cation students read more than twice the number of books and experienced changes in attitude. . 

Smith 1 1975) compared the gains of 75 junior college students 1 enrolled in a diagnostic/prescriptive 
program with a traditional control group No significant differences were found between reading 
gains of the two groups. However, both groups made significant gains, during the eleven 
week instructional penod Rivera M 974) found-that students enrolled in Jrfr eight week program 
of ndividuahzed tutorial assistance .n reading^and study skills while failing to attain significantly 
highnf reading scores or lower attrition rates than control students not receiving tutorial assistance, 
did achieve significant improvement m GPA 

ABB Programs Knowles (1975) compared demographic, personality and achievement data of 
students'^ "a state-supported county-wide ABE program with those enrolled in the volunteer 
tutor Right to Read program and concluded The two programs served essentially the same highly 
vaoabie populations Personal rather than media recruitment was preferred by participants and 
provision of transportation significantly affected involvement, for functionally illiterate adults. 
Us.ng a sample of 443 subjects. Lamb (1975), working with adults in four Florida counties 
(ages 16-56 reading level 3-8). found mat general instruction increased critical reading 
.ability white maturation in the absence of Instruction did not Hutchison (1974) found that an 
experimental program with lessons divided into teaching; practice and application was significantly 
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m0fe effective than large group instruction 'utilizing commercial materials over a fifteen week - 
period on both achievement and attendance measures but not on vocabulary or oral reading. 

Spears (1974) found significantly better academic growth and retentiqn for an individualized, 
programmed laboratory program than for format adult education programs involving individualized, 
small arid large group instruction in self-contained classrooms in neighborhood schools. 

Johnson (1975) concluded that CAI programs were effective in ABE in a study involving 
seven program components (content, question type, question hierarchy, terminal choice, branching, 
timing, feedback) and five? learner characteristics (history of failure, negative attitude toward 
school atmosphere immaturity dependency, suspiciousness) 

Tinkle ; 1974) found significant advantages for massed over spaced instruction using prison farm 
inmates, even though both types of instruction were effective with illiterate inmates. 

instructional Methodology 

Gnewuch (3 974) assessed the effects of a program designed to increase awareness of words 
and word meaning based on Dale's principles Change scores for ?, 12 week program for "318 
experimental and 89 control university reading and study skills students on ..the NDRT indicated 
tnat teacher training and experience contributed more than ten times the amount of variance 
contributed by treatment or classes Wolfe (1975)Jound no differences in writing complexity or 
usage for three vocabulary development programs teaching a specific reading vocabulary with 
practice of words in sentences, teaching a specific reading vocabulary with practice of words 
m multiple choice exercises, and no specific vocabulary instruction. The effects of modified 
paperback scanning and vocabulary extension instruction on NDRT (vocabulary) and Carver-Darby 
/ Chunked Reading Test scores were investigated by Baer (1975). ANOVA, using 75 volunteer 
undergraduate psychology students, indicated the paperback method was superior for reading 
accuracy £ut not for reading vocabulary, comprehension and rate. 

Henderson (1975) analyzed pre and posttest scores on reading rate, vocabulary and compre- 
hension self-concept, reading interest variables, and self-expectation for eight intact commiinity and 
junior college classes randomly assigned to prescriptive or personalized reading improvement 
instruction Stepwise discriminant analysis indicated no significant pre or posttest differences with 
the exception of males m the prescriptive group over males in the personalized group. 
Allen (1976) considered three levels of student control (minimum, moderate, maximum) and found 
no significant differences m reading achievement (DAT), attitude toward reading (Estes) or attitude 
toward the course by treatment group, and no retationship^etween student personality orientation- 
(Rotter l-E Scale) and reading achievement in any of the college reading groups. The reading 
ach.eveme*it and interest gams of two year college students taught by a personalized 
technique (cognitive field theory) and a prescriptive technique (behavioristic theory) were investi- 
gated by Henderson (1976). No differences favoring either group were found for either dependent 
measure Mahoney (1974) randomly assigned 119 university students taking reading and study 
skills classes to a traditional teacher-centered approach, a didaetic tutorial approach, a facilitative 
tutorial approach and no instruction ANOVA, post hoc comparisons and Chi square indicated 
an experimental groups superior to the control group on NDRT scores but not GPA or mean . 
verbal GPA * 

Kistutentz ( 1 976) compared academic course achievement scores, GPA and NDRT change scores 
of students enrolled in a community college reading improvement course and assigned to traditional 
instruction or sections where vocabulary and comprehension exercises were derived from com- 
position mathematics and science texts Both groups made significant gains. in NDRT reading 
achievement There were no differences in grade point averages, reading improvement scores 
and achievement in the writing and mathematics courses, however, there were significant differences 
m favor of the experiment group in achievement in the science bourse. 

Bam (1974) compared the reading gams of 60 experimental and 60 control students enrolled 
m a colleg* reading course who simultaneously listened and read SRA materials and read without 
tapes NDRT gam scores using t tests indicated no significant difference by treatment. The 
reading comprehension gains of students stratified by reading achievement and assigned to 
reading listening and combined treatments, were analyzed by Thomas (1975). ANOVA results for 
the original experiment and three replications indicated significant main and interaction effects. The 
dual mode was more effective than unimodal presentation and listening more effective than 
reading The interaction suggested that Jhe lower the initial ^reading level, the more -effective 

the dual mode . 
Gurrola (1975) applied ANOVA to gam scores on teacher constructed tests for treatment 
* groups taught study methods based on different combinations of SQ3R components and found a 
treatment effect for study efficiency and a reading level effect for study effectiveness with a 
population of university freshmen 

The effects of proportional amounts of time spent in diagnosis and in instruction on achievement 
gam for students were examined by Livingston (1975) Increased diagnosis facilitated achievement 
gams for average and below average but not for above average groups following, ten weeks of 
instruction . ^ 
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College- Adult Reading Personnel 

Phlegar (t975) surveyed the attitudes ot -faculty and examined effects of an inservice readirtg 
program tor faculty m a community college Results for all faculty members, and responses ton 
seiected students and faculty members, suggested inservice affected teacher attitudes toward 
content area reading but did not change student attitude, female teachers were more conscious of 
the importance of reading and more experienced teachers more aware of the complexity of 
reading, attitudes of teachers and students were similar and few differences appeared between 
academic and occupational-technical teachers, married students were more serious about reading 
and their classes and females enjoyed reading more than males, although males reported 
spending more free time readina about the subject 

Schnell ( 1 974) provided a profile ot the community college reading teacher based on questionnaire 
returns from 47 members of the IRA special interest group for two year colleges The composite, 
community college reading teacher was described as being female, forty-five pr older, highly 
educated and experienced in teachtng She feels her college training was only adequate and that 
experience has been more valuable than course work Ramig and Dulin (1975) described a one- 
year read.ng methods' seminar offered by the University of Wisconsin-Madison for adult basic 
education teachers including program goals, administrative considerations, format of the course 
and evaluation m which students rated the course and rank ordered, in terms of usefulness, ten* 
areas of course emphasis 

Rate and Flexibility 

Perception 

In examining factors which contribute to the difference in reading speed among young and 
okU-t .uje readers Marcel (.1.974) ruled out visual processing factors such as interfixation eye 
movements number of regressions, fixation time and larger eye-voice span (EVS) and hypothesized 
that a prime cause* of the difference is increased use of context during a single fixation. 
The amount ot context used during a fixation is embraced by the "effective visual field" 
which can Df estimated by degrees of visual angle, typespaces. syllables, and words. Use of 
context is at^cted by nature of the task, subjective criteria, and factors within print materials. 
A group of 30 'ast and slow adults (19-25) and a group of 48 fast and slow children (10 5-11.5) 
comprised the experimental groups The method of investigation involved simulation of two 
successive fixations over a continuous sequence of words with sequential constraints manipulated. 
The amount read from the second fixation was the dependent variable It was concluded that 
whatever other factors are operative, a better use of context enables a greater amount of 
graphical -nformation to. be used m individual fixations thus reducing the number of fixations needed 
to cover the same materials Increased contextual information facilitates the use of visual infor- 
matton as well as effecting response bias in word recognition and also appears in so doing 
to produce spare capacity for visual processing for use at the more peripheral locations of the 
visual fieid 

Following somewhat the same line of investigation as Marcel. Jackson and McClelland (1975) 
tested fast and average college readers, scoring above 70% on a carefully constructed com- 
prehension test, on four tasks (free report, forced choice, unrelated letters, peripheral span) to 
determine if reading speed is primarily sensory or cognitive in nature. Their results indicate that 
faster readers are capable of encoding more information from each fixation suggestion increased 
cognitive processing The availability of more information facilitates the use of contextual con- 
straints resulting in more efficient use of partial information At the same time, the greater 
efficiency in encoding of visual information frees limited, processing resources to deal with the 
conceptual content of what is read and guide the eye to a useful place to begin the process anew. 

Rayner (1975) suggested that previous tasks used in determining the size of the perceptual 
span were either quite unlike normal. reading or interfered with a reader's normal Reading pattern 
and thus resulted in differing estimates of the srze of the perceptual span. Recent on-line 
computer techniques involving eye-tracking systems also have limitations although these more 
nearly approximate normal, reading Ten undergraduates were exposed to CRT and on-line tech- 
n.ques which, recorded the duration and the location of each fixation in reading and the movement 
time for each saccade Results suggest the reader is able to make a semantic interpretation 
of a word that begins 1-6 character spaces from his fixation point, from 7-16 spaces the reader 
picks up gross visual characteristics such as word shape and initial and final letters. It seems 
.clear that the capability of distinguishing between a word and a non-word, or making a semantic 
interpretation, is operative only within the area of high visual acuity-a rather small range for the 
effective stimulus Other characteristics can be acquired further into the periphery to- a maximum 
of five degrees of visual angle to the right of the fixation point. The overall results suggest 
slightly larger perceptual span and peripheral areas than reported m previous research. 

Edfeldt (1975) examined the old question of whether or not it is possible to read during 
saccadic movements Current theory is reviewed which suggests that reading (seeing) takes 
place during fixation and that the reader is functionally blind during saccadic movements. 
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Us,ng newly developed apparatus to assess saccadic seeing. Edfeldt suggests that the good 
leader ut.lizes the ab.hty to read and comprehend even<v7hole words during saccate movements 
n!wwa rf.ng more exposure i.me to maier.ais than the poor reader It is stil to be determined 
saccadic Lg ammenabie lo training or if H is part ot the individual s Central Nervous 
Svstem blueprint • , 

i 

Factors Afecting Rate ana Flexibility . ; , . . _„ 

McConk.e and Meyer (1974) suggest that task structure variables have a-substantial effect on 
me behavior wh, C h subnet's exhibit durmg a reading Situation Using 70 undergraduate .subjects, 
a?d wastages with MC questions bum on higher level factors (Bloom), they inyes tigatec the 
.nriuence of tour variables (payoff structure fornvof payoff structure, type of payoff, p esence or 
absence o° feedback) on reading rate and comprehension The- existence of a payoff structure, n 
and bv .tsetf had utile effect but fhe form ,n the structure produced significant changes in 
reading -ate Feedback appears to be a or meal variable in producing reading strategy chan f . 

Samuels and Dahl ,1975. used 1.100 fourth graders and 84 college students in a modified 
Latm square-design to determine the effects of establishing appropriate purposes on flexibility 
and reading rate Sublets read a prose passage and answered deta.l and general overwew 
questions Repeated measures ANOVA revealed significant differences in reading speed when 
suoiects knew beforehand the kinds and difficulty of the questions they would, be asked, both 
qroups scored significants faster for overview than for detail questions Dq= hi™ 

Sullivan (19751 analyzed pre and post treatment scores on an adapted version of the Reading 
Adequacy READ Test tor a one-week experimental flexibility module using university under- 
graduates and a control group ANOVA and.correlation revealed no significant relationship between 
-ate and reading materials, factors of general reading ability and reading flexibility 

Brandt <i<?75) mvast.gated the effect of two reading rate instructional approaches (SRA 
mechanical ve<sus motivated), using Ohio State university students participating in a reading 
J™eme,.t course ,n relation to internal/external (Rotter l-E). focus of control pnentat.ons. 
S.nqle factor ANOVA revealed groups receiving motivated training increased rate significantly 
more thqn the non-motivated treatment The correlational analysis between personality type and 
reading peiKwmance was significant only for the control group , 

Coke (1974. evaluated the usefulness of reading rate as a measure of difficulty in two 
studies relating to text-denved measures of readability Oral reading rate was affected by readability 
since subiects'read at a standard syllable rate When subjects rated each passage for compre- 
hensib.i.tv correlation of comprehensibility with readability indicated that sublets were sensitive 
to readability . suggesting there are limitations in using reading rate as a measure of reading 

d, C*Uer (1976. studied the functional relationship between word length, passage difficulty and 
reading rate usmq -.300 passages sampled from cirr.culum materials ranging from QTad^OTre 
tnfouqh graduate level Passage difficulty was measured by the Rauding scale. Reading .rate of 83 
college sublets decreased in relation to passage difficulty, but. when measured in standard words 
Per mmute reading rate was constant The results are discussed in relation to current theory 
'eiatma redundancy to reading rate and eye movements 

Wood M976) assessed .he" comprehension of general and study materials presented a. various 
generated speech speed rates to 208 post-secondary technical students -stratified by ability. 
ANOVA and post hoc comparisons revealed significant d.fferences by reading ability in compre- 
hendinq ma n deas details and conclusions; and by rate for details Seventy-f.ve percent of the 
subiects preferred ffebmpressed speech rate of 200-250 words per minute 

* aip^g the Modified Packard Method of discourse analysis to recalls o, readers trained 
n -he Evelyn Wood method Hansen (1976) found varied differences between slow and I rapid 
made-s when t.me but not amount ot material was held constant, s ow ( readers sh owed 
slgn.txan! gams .n recall of operations, with time and amount of material held constant, rapid 
reade's showed siqmficantly greater gams on total operations Mm . 
' e3 K *X oVo and Duim ,1974. described development of a model for evaluating cer'am aspects 
r<WkOOOk*s wh.ch should be considered when developing or selecting texts to be used in 
m" v mHate o« comprehension factors include difficulty of material, difficulty of questions. 
M. K.;f-ii,n<3 me mended audience and the reader's purpose 

* * 

Readability and Materials/Textbooks 

W *M*eT%74i .ompared the predictive perfcrmance cf the Flesch (multiple-choice-based) and 
•ne B.irm ut h rcipze-based) readability fPrmulas using -the McCall-Crabbs Standard Lessens in 
Readina and the 1967 MiHer-Coleman Readability Scale as criterion measures Findings confirm 
m e "on w<on of doze proponents that cloze-based formulas generate higher validity coefficients 
on ; ioze' mate^ls than mu'tiole-choice-basdd formulas generate on multiple-choice materials. 
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However Miller suggests that results may be due to the inherent nature and mechanisms of 
the scores generated by cloze procedures rather,, than demonstrating the Intrinsic superiority of 
the cloze Liau et ai (1976) reported the development of four readability formulas based on 
cloze procedures which oxpand upon theearlier work of Coleman. The formulas, which account for 
an even higherswcentage of the variance, were derived by choosing a more reliable criterion 
and a better selection of predictors selected according to ease of scoring as well as percentage 
of variance explained A cross-validation study supported the development of the four formulas. 
Coleman and Liau (1975) maintain that there is no need to estimate syllables for computer 
readability analysis since' word length in letters is a better predictor of readability than word 
length m syllables Cloze was used to calibrate criterion materials. A new readability formula 
was generated with letters per 100 words and sentences per 100 words as the predictors. 
Both can be economically estimated with optical scanning equipment. Estimated grade level 
equivalents for cloze scores are also presented McNinch et al. (1974) tested the effects of . 
varying deletion schemes in cloze estimates of college texts in science, mathematics and social 
studies using 184 college freshmen from a psychology course Deletion schemes should be 
adjusted according to discipline studied, science should hay^ a low deletion ratio while 
social studies and English can tolerate a more moderate (every seventh word) ratio. A moderate 
deletion ration was also tentatively suggested for math materials. 

Using six analysts, Thomas. Hartley and Kincaid (1975) estimated test-retest and interanalyst 
reliabilities, for the Automated 'Readability Index, the Flesch Reading- Ease Score and, the Fog 
Count All coefficients, with the exceptional one Flesch measure, were above .94 ANOVA applied 
to measured working times indicated that the Flesch takes significantly longer to use than either 
the ARI or the Fog Blumenfeld, Black and Morris (1975) investigated the interrater reliability 
of the Gunning readability index The correlational analysis suggested that while the results 
of me two independent ratings were not identical, the interrater reliability was of sufficient 
magnitude to be considered adequate The means exhibited a slight constant error suggesting 
Ml*?* framing to, decrease interrater error Blumenfeld and Justin (1975) used fifty random samples 
of reading materia, 1 and obtained highly comparable readability ratings by Flesch and Gunning indices. 

Patterson <t974i analyzed the relationship of selected intersentence factors to cloze compre- 
hension scores of passages from different subject-matter areas to determine if intersentence 
factors add predictive power to the Dale-Chall readability formula when cloze scores serve as 
the measure of ■ comprehension difficulty Cloze tests (seventh word deletion) were constructed 
and Carron's and Yule's type-roken formulas used to determine the diversity of lexical units and 
words .n the passages The verb/adjective ratio and the noun-verb/acyective-adverb ratio were 
used as measures of passage natural language content. Correlation, stepwise multiple regression 
and ANOVA revealed that diversity of lexical units as measured tty Carroll's and Yule's 
formula, and diversity of nouns as measured by Carroll's formula, were significantly related to cloze 
comprenension scores For these subjects and materials, new readability, formulae emphasizing 
lexical density should be considered 

The cloze performance of 120 community college freshmen was analyzed. by McWhorter (1974), 
to determine the influence of passage organizational structure on comprehension and two 
estimates of readability Four passages representing time order, cause-effectA comparison-contrast 
and statement-support organization were used The Bormuth and the Dale-Chall readability 
formulas were applied to the passages and a fifty item cloze test was constructed for each 
passage Mixed results indicated the Bormuth formula, provided a significantly higher estimate 
'of combined passage difficulty than did-cloze scores; observed cloze means\ were significantly 
higher than predicted cloze means for all but the time order passage: the time order passage 
was significantly more difficult than the other passages;.and the Dale-Chall estimate of combined 
passage difficulty was significantly higher tr;an the Bormuth estimate. 

Morrow (1974) rewrote college level expository prose to determine, if easier readability would, 
aid comprehension of low achieving community college students. Material was adjusted by 
reducing difficult vocabulary, shortening sentences, and a combination of the two. Cloze tests 
were constructed for the original and rewritten passages and materials presented to three experi-' 
mental and one control group of students from remedial reading crasses. The findings support 
previous research in demonstrating that reduced readability aids comprehension. English ACT 
scores I Q .' and Vocabulary and Comprehension scores had significant curvilinear relations with 
tne cloze criterion scores V* 

Bankston (1975) applied Kane's readability formula for mathematics material and rewrote one 
mathematics lesson at three levels of difficulty When presented^ to remedial mathematics 
students, the group receiving the lowest readability version had significantly better post test 
achievement than the other two groups 

Bassm and Martin (1976) and Bassin (1976) tested the effects of three redundancy /eduction, 
techniques on reading comprehension, reading time and rate. A newsarticle was reduced 10%, 
30% and 50% by one of three reduction techniques: word frequency, grammatical, and subjective. 
ANOVA and post hoc comparisons indicated reduction method had no significant effect, on 
comprehension except for the 50% level where the subjective method produced better reading 
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performance There was a decrease m speed in reading the reduced versions of text. 
Herndon (1976) examined the role of redundancy in textual processing and, using methodology 
from computer stance and education, generated a system forcomputer control of redundancy 

Kiare (1975) evaluated 36 experimental studies which examined the effect of readability of 
materials upon comprehension and/or retention In each of the studies reviewed, materials 
were selected the readability altered, the material presented to subjects, and the effect on 
comprehension and/or retention estimated. In 19 studies, altering the readability of the material had 
a significant effect on comprehension and/or retention; in 6 studies the effect was mixed, and 
in 11 studies the effects were nonsignificant. Comparisons among studies were made to 
elucidate factors which interact with readability including reading performance measures, the test 
situation reader motivation, readability considerations in' materials, content considerations, and 
reader competencies ATtention to such factors could have helped to reduce or increase the 
probability of significant results Kiare concluded that many writers succeeded. in writing materials 
readable enough for readers to increase their comprehension and/or retention but the exact * 
variables operating in the process are not clear Tables are presented fisting the major 
variables thought to have an influence on readability. - ' 

Boston (1976) used three groups of 95 university students to test the effects of line 
width on /eadmg speed and comprehension in brochures and booklets.. ANOVA revealed one- 
column material to be significantly superior to two-column material for both speed and compre- 
hension, contrary to previous legibility research findings • 

Based on the assumption that existing readability indices lac* validity in accurately judging 
the concept or ideational difficulty of passages, Carver (1975-76a) developed the Rauding Scale 
Of Prose Difficulty (encompasses reading and understanding) as f a tool for AJSing subjective 
judqments to scale difficulty of passages. The RS consists of six anchor passages (from Bormuth, 
1969) judged to be at the grade 2. 5, 8, 11. 14. and 17 levels of difficulty. New material 
is compared to the anchor passages and the opinions of three "experts" (achieved the criterion 
of ludgmg five passages correctly) averaged to obtain a rating. Evidence from three studies is 
presented to support the reliability and validity of the method including: psychometric and 
mterrater reliabilities, validity coefficients in predicting Bormuth grade levels; using RS to estimate 
grade level of reading materials commonly used at various levels; and, prediction of passage 
difficulty using the National Reading Standards Test developed by the author. Kibby (1975-76) 
subsequently pointed out that the idea of an anchored set of judgmental passages is -a, sound 
one but that the validity of the Rauding Scale is questionable on a number of points including: 
lack of connection with previous readability research; poor definition and lack of a theoretical 
rationale for idea or concept difficulty; grade level discrepancies in the anchor passages 
problems with rater underestimation of passage difficulty; faulty study of sample reading materials; 
and inadequate sampling, poor control, skimpy or inaccurate interpretations of data in the three 
studies Kibby concfudes that the scale adds nothing to our understanding of comprehension, 
it is not supported as a readability measure, apd it provides no clear guidelines for rewriting 
materials In a response to Kibby. Carver (1975-76b) presents detailed counterarguments on 
I me points raised and contends that, despite the criticisms,, the Rauding Scale is able to assign 
\grade levels with substantial validity The RS seems to offer the best "mousetrap for 
\estimatmg concept difficulty of passages at the present time. 

Materials/ Textbooks , 

■ Has'kins (1975) described the development and assessed the concept validity of a technique 
for pretesting interest in printed messages A dumber of studies are described which support the 
position that use of short descriptive verbal concepts is predictive of interest fn the whole 
message Haskms further discussed the technique as an efficient and inexpensive alternative to 
complete curriculum review 

Booher (1975) compared the relative comprehensibility of pictorial information and printed words 
■n instructions Six picture-word formats were examined (print only, pictorial only, pictorial 
related print, pr'nt related pictorial, pictorial redundant print, print redundant pictorial). Results, using 
90 Navy enlisted personnel, showed pictorial information important for speed but print information 
necessary for accuracy with comprehension most efficient with pictorial related print ajid pictorial 
redundant print formats Kammann (1975) tested the relative comprehensibility of flowcharts 
versus standard printed instructions using 90 scientists and engineers, 90 suburban housewives, 
and Bell telephone employees in a follow-up study. Two different flowchart formats were com- 
pared with standard paragraph instructions and were found to be superior in comprehension, 
accuracy and speed Comprehension for printed directions appear to follow a general rule of 
thumb directions are understood correctly about two-thirds of the time 

A number of studies analyzed the difficulty level of college textbooks in relation to the reading 
ability of students using them Smith (ld75) applied the Flesch formula to 162 one hundred word 
samples from nine accounting tests (3 high school. 3 vocational and 3 college level). ANQVA and 
post hoc tests revealed differences by level and for level by publisher but not for publisher 
alone Mean readability scores increased in difficulty by level and were considered below the 
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grade level for students for whom the texts were written Nilagupta (1975), using a sample of 213 
university students (Bangkok) enrolled in English as a second language reading classes, found a 
relationship between reading ability {Nelson and an informal test) and text readability (Flesch). 
The four textbooks, had readability levels above the mean grade equivalent 'comprehension 
scores of students By contrast, Slovak (1976) found no relationship between the. reaaVia, 
ability (NDRT) of 760 community/college students' comprehensive of texts scaled by readabiUty 
level (Dale-Chall, Flesch) Kyrzman (1974) found that the reading ability of open-adnTiasions 
college freshmen averaged 10.5 while the readability of their textbooks averaged 14.0. 

Working with adult materials Rakes (1973) assessed the readability of 29 units of material 
selected from books frequently used in ABE instruction using Dale-Chall, Fry and SMOG indices. 
The resulting readabilities were compared with difficulty judgments made by 43 ABE teachers 
with discrepancies noted between the formula levels and the teacher estimates. 

Pyrczax (1976) found that 100 word passages Of the instructions accompanying income tax 
returns yielded a readability of 9-10 using .'the Dale-Chall formica. He composed a set of.conv- 
prehens-on questions on the standard instructions and a set for a revised simplified version. On the 
first test graduate students in education produced a mean score of 5 65-functionally illiterate. The 
simplified set of instructions produced a mean of 7.65. 

Test Development, Evaluation and Use 

Various techniques continue to be proposed for the measurement of reading behaviors; 
particularly in the area of comprehension of prose materials. The fundahnen-tal problem in such 
measurement has always been to develop techniques which enable one to estimate how much has 
been comprehended m reading and how accurately it has been comprehended. The inherent 
difficulty arises from the fact that the processes one is attempting to measure are internal and 
cannot be directly observed, they can only be inferred from observations Of sampled overt 
behaviors that accompany the internal processes \ 

The researcn reviewed in this section is organized .in two parts. First, studies, are reviewed 
which report development of tests of various types Then studies which assess characteristics 
'and uses of existing tests are reviewed along with studies which discuss various\desiderata 
relevant to psychometric properties of tests and their use 

! \ 
'Test De^eippm&nt , <■ N . 

Anderson (1974) examined the possibility that cloze measures are a function of content 
achievement among adult learners College juniors and seniors took cloze tests before and after 
studying a prose passage under five conditions mc, reproduction passage cloze test, recognition^ 
passage cloze test, reproduction summary cloze test, and recognition summary cloze test. Post 
instruction mean scores were reliably higher than preinstruction scores in virtually all testing 
conditions lending support to the proposition that cloze measures can serve as indices of 
achievement c 

Ohnmacht and Fleming (1974) replicated and expanded work on analyzing content effects and 
complexity of structure effects with three types of verb deletions and two t'ypes of noun 
deletions in cloze test development Results with undergraduates generally confirmed earlier 
studies that suggest verb types do not lend themselves to categoric rank ordering from lowest to 
highest difficulty of structure An additional finding concerned the "devastating effect" of the 
absence of post-deletion context on subjects' performance with all three verb classes and, Jo 
lesser degree, with noun classes 

Edwards (1974) examined the reliability and difficulty level of cloze tests of poetry using 600 
university freshmen composition students Results suggest; deletion pattern did not significantly 
affect reliability, no significant difference between reliability for cloze and for a Poetry. Reading 
Test fSTtf Reading Series II) on the same passage; choice of deletion PaU^C> affected 
difficulty||vei. number of responses required did not affect reliability; and printecj/format had no 
effect on reliability . . / j 

Saiup (1975) investigated cloze test performance and relationships between .the Coop English 
Test-Reading and cloze scores using 776 university freshmen and teported corrections of 
cloze with reading comprehension of 36 and 63; KR20 reliability of p6 antNCa^ljeliability of 
41 Good readers had higher scores than on use of correct grammar and total cloze scores ' 

Lowry and Marr (1975) tested and supported the construct validity of Darnell's cloze entrqpy 
procedure 

*Coy (1975) analyzed data for 38 functionally illiterate adults in assessing criterion and construct 
validity and reliability for an Adult Reading Rate Test. Correlations between the ARRT and 
reading gain scores were deemed insufficient ( 60) for use of the test as a predictor of an 
individual's specific performance score, teacher judgment was Inadequate as a measure of the 
rate of mastery of basic skills, factor analysis, yielded only one fo the five ARRT traits 
hypothesized in constructing the test; and the reported reliability for the test was .81 . 



•The development ar.d field testing of an instrument deemed appropriate tor the domain ol 
on ihtiob 2 found to be essential. ,n entry-level positions of selected plants of a large 
epoM^l oy Schoenfeldt et al (1976) Reading skills found to be importan ,n 

SUs,Za,«.ials y WWe ,den„„ed and related to the types > of. ^tenals read on the fob 
A criterion referenced' instrument was developed to sample the conten ioma.n < «rth doMPJ 
cedure used to Atabhsri the criterion pass-fail cutoff in addition to the usu al 75 * /'^g' 0 ^ 
passing level Dale-Chail readabilities were also computed for passages Content validity pro- 

college students reading passages at four levels ol dilleulty in the "» " d b 

experimental conditions replicated those used in the 1974 ^ study ^ Sveteit , or measuring 
u», aopea.s » M ess sens*™ o ™ P.™, e«c.j ol " ™=\™«„ "''J* 150mo , p »c 
,« measure comprehension of those concepts The ^ a xec^niqae 



each unit 
generated 



uninteresting, only 7* of items were not attempted by a ^'75 % oMh that 

One problem m using cloze techntq^clam Ca rstens and McKeag < >• da , a 

students may become discouraged when test resu Its seem ' ow ! M gnd , doze 

-feedback from more conventional assessment p»o e du ^ h s e y, a h n a a ^~ ^ ring Suited in 

■S^JSK p =e ° « to be used in ev " a,ing 

"San ,,974, found a correlative. .61 between scores ™ ^& < ^« B £S 2 
• presented for the tests olrtP10 H riividP the Brown-Holtzman SSHA into two 
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battery wtth respect to their specific value as placement tests for students entering an open 
a'amissions community college Factor analysis results tor the 1,534 students revealed all reading 
tests ibaded -highly on me same factor, as did the grammar tests of ^he CET and the 
CGPPbdtter.es ' I 

Williams (19/fcj) u-.».hJ hvu methods of measuring frustration to comparelcoliege student's 
degree of .frustration on informal reading inventories Use of a rnultitrait multimethod matrix 
indicated no physiological variables could definitely discriminate frustrating from non frustrating 
reading material, difficulty level of materials appeared to affect comprehension more than 
pronunciation. Bett's criteria and self-rating scales tended to measure the same aspects of reading 
frusu auon and. were moderately related, none of the methods appeared to definitely discriminate 
among frustrating ana nonfrustratmg reading passages or measure degree of frustration with a 
high degree of validity, and. Rose transformed GSR and hear^.rate were the most reliable of 
the five measures • « 

Hdfist>an and Cattell (1974) discuss results of their factor analysis of 57 var.abJes from ability 
.tests administered to 343 subjects ages 15-55 The report includes discussion of the first-order 
factoring oniy The commonTfactor. analysis revealed 1 9 intercorrelated primary abilities ranging from «. 
such previously well established constructs as Verbal and Numerical ability to less well-understood 
factors m the memory and fluency-originality areas 

Uhnmacht and Weiss ( 1 9^6), using a computer simulated exemplat. considered various modejs of 
assessing construct. validity and suggested that a third dimension, that of time of assessment, be 
added to the two dimensions currently being assessed in construct validity studies based on 
'he supposition that learning to read is not* a static proposition but a dynamic developmental 
process • 

Pyczak n975) reported that results of a number of studies indicate a substantial number of 
tens ;n standardized reading tests lack passage dependency and presents results from a brief 
•,' ( .dy ru confirm the observation Preston's 1964 study was replicated by .Keetz (1975) by 
navmg l i8 fresnmen answer reading test questions without having first read the text Tne findings 
ijoMrrrp the ea. f -er work showing better than chance performance on questions and a posjtive 
correlation r>*\\een test scores and general reading ability ^ 

< 

Wo/d Recognition 

Stud'es from the word recognition literature are reviewed which touch upon various topics 
related to "older readers including the role of phonemic, graphemic and semantic mediation in 
lexical access and word recognition, develo^men'al changes in word recognition strategy; 
processing operations in word recognition, the role < f units smaller than single words; and, the 
e ff ect of various fmguistic and affective attributes of words in lexical access. 

a number of- recent studies have examined the role that phonemic and graphemic codes play 
n accessing lexical meaning of words Several studies suggest less skilled readers (poor decoders) 
/may also be poorer comprehenders (Gohnkoff and Rosinski, 1976; Perfetti and hogaboam, 1975; 
Gonnkoff 1975-76 Pace and Gohnkoff, 1976, Leyy, 1975) and that phonemic recoding is an 
extremely mportant skill for both younger and older readers. Studies also suggest a tolerance 
f or weaker versions of phonological recoding and entertaining a dual access hypothesis with both 
graphem»c and phonemic information playing a role in accessing lexical meaning (Baron and 
McKillop 1975, Kleiman. 1975. Meyer. Schvaneveldt and Ruddy, 1974; Bradshaw, 1975). 
Skilled readers do tend to rely less on phonemic recoding. 

Samuels Begy 'and Chen (1975-76) compared the word recognition, speed and processing 
strategies of less able and more highly skilled children and adult subjects ^Results indicated 
'hat more fluent readers were faster in word recognition, superior in ability, to generate a 
target word given context and minimal cues from the target, and superior in awareness when a 
' f ase recognition was made The results are interpreted as support for the efficacy of trailing 
:n word recognition strategies The research of Enksen and Enksen (1974) confirmed the 
importance of first letter position and left-to-right processing order in word recognition and 
supported the fact that instantaneous, decoding of the sound-spelling pattern of the language 
typifies proficient readers 

Green and Shallice (1976) asked whether phonological encoding is an obligatory stage in 
reading or whether direct access to a word's semantic representation is the general rule. 
With results from adult skilled readers, they argue that meaning may be accessed directly from 
visual form and does not normally involve prelexical phonological recoding; phonological effects 
occur after lexical access rather than before.. They point to problems in the use of the lexical 
decision task m such studies Kleiman (1975) suggests that phonological recoding may play an 
important role beyond the individual word level in printed discourse by providing the reader with 
a strategy for maintaining in memory the words of a sentence long enough for the sentence 
to be comprehended 

Fleming (1976) presented skilled .adults with nonsense words in three groups in two studies 
to test the phonemic recoding versus the graphic encoding alternatives to lexical access. 
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Results suggest there are alternatives to phonemic recoding and that the graphic encoding 
h ¥ £nes,s may be more appropriate The important pom. ,s made that th e c^.ng J»oce« 
employed may be .o"t.,»gent upon the task required m various situations Bradshaw (1975) 
suggests that the m,i,u -ife, u. relatively simple material seems capable ot extracting much of he 
semantic content of a passage w„hou, yomg through a necessary pnor stage of 
recod.ng and concludes that an interactive, model of. word recognition .may best explain the 
process with perceptual units larger than the individual letter children 

Mason ,1976) compared the results of word-nonword recognition tasks and ,ound J^t f W on 
and adults we e alike m being ! affected by word familiarity and interaction between initial 

5,^m negtenc and wora famman.y Children were also affected by final consonant frequency 
W^wS^BlewraJei regularity and word familiarity Skilled readers can recognize 
words S extensive orthographical analysis Unskilled -aders may us e ^ he sarne 
processes but revert to phonological encoding when a word is unfamiliar and contains an irregular 

"sSwarn (1975) examined whether performance m recognition memory tasks depends on the . 
recognton of a specific interpretation ot a word (as originally encoded) or upon its orthographic- 
phoneme representation A i.sl of common words paired with weakly cued ranwortto wa s used 
The data suggest the recognition of a word depends on its physical representation as well as. 
M meaning Under certain circumstances, phonemic and orthographic information may be as 
useful m a recognition task as the particular meaning ota word 0 ,„ r>m£>n tai 
flaUr (1976) used a population ot children and adults and demonstrated a clear developmental 
effect such that, with increasing age sub,ects were more likely to choose alternative words 
Sne same shape as the stimulus words in artificially constructed word trigrams quadrigrams 
and warns Heavy . reliance was also made on the first letter m alternative choices 
mi ouqh he tourtn grade The results support Gibson's hypothesis that there is a developmental 
tanqe wm u 'creasing aoe in feature analysis and extraction. Kindergarten children do not have 
a e^cien strategy ,n dealing with the delayed recognition task; second through fourth grade . 
c^d en Took for a y |,erna.,ves 9 w,th the same first letter; fifth and sixth graders continue this 
bu also look tor alternatives with the same shape; adults rely much more heavily on shape 
than 1st Tetter Word shape, when it is conceptualized as being made up of distinctive 
features wh.ch make up the letters of the word, is an important variable in word recognition. 

Lazerson (1974 75 found 'that the 'skilled reader of Modern English can tolerate no.se in the 
featSTp layVne drawn through print). However, highly variable spelling has a more detrimental 
eftec m as res of reading performance Featural information serves as the catalyst which 
enabes reTdTg ro occur but the skilled reader relies upon orthographic and syntactic-sementic 
Tto'mation stored in the bram demonstrating that skilled reading performance ,s a highly 
cognitive process rather than being simply a phonemic or visual activity. . 

Santa (1 976) attempted to determine whether whole words, single letters or spelling patterns . 
operate in ?h .recognitor, of isolated words. Using younger readers and adults, ,t was concluded 
ma the development of reading maturity appears to correspond to more flexible word recognition 
Qtr^niPQ and an increased dependence on larger units of analysis 

Studies of th ead*ng Jocess-have shown that stalled readers-are able to extract the meaning 
o| text Without ducted Preference to the appearances or sounds of individual words; words can 
arewSe heir loaoaens without phonological mediation and attention to words can also be 
minimal S?or o theT recognition Underwood (1976) tested the effect of unattended printed 
words urn a picture-naming task to determine if print can be read even when it is no 
tended to Fmdmgs suggest attention may not be necessary for word recognition provided that 
f"e thresholds of words are lowered by current contextual information. If attention is highly 
«piprtivp then nnlv semantically related items will affect performance 

For er andBednall (1976) tested the assumption that lexical -access involves a search process 
,n one experiment test items were classed as ambiguous or unambiguous and in another they 
were classed according to their syntactic properties In both experiments, when the target of the 
sTarcb was aTonex sfent entry, an exhaustive search was involved. Frequency of occurrence 
was no onoer related to decision time as m lexical .decision experts The search model 
^equate Iv explains the procedure Whereby .the most common meaning of a homograph is 
acce sed but the less common meaning is accessed in some completely different manner. 

Bali Wood and Smith (1975) asked whether there are differences in the order in which the 
visual acoustic amcum ory and semantic codes related to words are accessed when one reads a 
word St k affle* that a semantic superiority effect exists in contrast to the 
not on that ome v?sua, processing must precede semantic coding. They concluded that the size 
o? °e den, Son units determine the order of processing with larger units being detected tort. 
?n •nerceotual search tasks visual processing would, occur first while in semantic search tasks 
he ia oe^emanTc target are accessed firs, accounting for the semantic superiority eftec 

Da as Ind Menkle (1976) investigated the facilitating effect of semantic context on word 
reco^n .on and Concluded that when the word recognition process approximates the stagey J 
stwh« receding, lexical memory retrieval and response execution it does not seem reasonable to 
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attribute the semantic context effect to the stimulus encoding orlexical memory retrieval states of 
the recognition process semantic context appears to aftect the late response execution stage 
of tne word recognition process 

, Kartm and Bower ii476) asked whether sublets coula categorize a word semantically before 
tt.ey precisely identified the word itself and tailed to support the notion. The problem of 
whether identification precedes classifrcation ;s poorly formulated as of yet, the presence or absence 
of the categoiy effect results "from the interaction between the type of stimulus and the definition 
of tne category decision tor a particular task 

Rubm (1976) measured the probability of correctly guessing a missing word (every tenth word 
deletion) us-ng tour context conditions all words before the missing word; all words aftef tha 

" Tubing word, all words before and one word after the musing word, and just one word after the 
missing word The probability of correctly guessing a missing word was greater given the 
words Defore i f than give the /words after it On a word-by-word level the advantage of 
forward context did not hold providing one additional word after a missing word provides 
much greater constraint ihan providing ten additional words before «t Although the left-to-right 
advantage holds over the long range in the short range stimuli need not be processed in their 
temporal order The results are contrasted with previous views of positional redundancy drawn from 
communication theory 

Cosky (^976) addressed the question Of whether we read letter by letter, if word recognition 
includes letter recognition then the difficulty of a wor J correspond^ to the difficulty of its 
• letters Letter difficulty was' assessed in two discrimination tasks and a letter naming task and 
easy and difficult letter words exposed to adult subjects Word frequency and length were also 
man,pulated »There was no effect for letter difficulty suggesting that letter recognition does not 
play a role m word recognition and we do not read letter by letter Whatever plays a role in word 
recognition -s smaller than the word and correlated with word length in letters 

Trovers (i') ;t >) ^examined previous- data on the processing of words and letter strings and • 
u>nr.tudeO the data added support to the hypothesis that visual features information from multiple 
letter positions within a word is normally encoded in parallel rather than undergoing a process of 
^er<ai letter fry -letter coding However, the data also suggested previous conclusions have to be 
qualified 

Sloboda t^'tM presented readers with verbal and musical text which contained spelling and 
notattonal er r ors and found errors were least likely to be detected when they were in the 
middle of wOrds or musical phra.es Inference of interior elements usually results from structural 
ra?n t v than vsuai functions 

Pearson and Kamil (1974) tested the time to process nouns and inflected and uninfected 
verbs and demonstrated word length to be an Important factor in latency However, the addition 
o f nfiections ranging from one letter to several 'added a constant to the response time Further, ^ 
i*e effect of Doth length and affixes vanished with practice While the results* suggest that the 
suDiect Degms Dv using a strict letter-by-letter approach he begins to realize there are other 
svategies for recognizing words Various hypotheses are tested leading to progressively better 
performance ■ - 

, Farley and Yen M976j analyzed the influence of affective^properttes of words on information 
processing time m a high-speed visual search task analogous to rapid skilled scanning in reading 
Woros extreme on emotionality yielded significantly longer reactidn times than neutral words 
supporting prevous research suggesting rated unpleasantness depresses learning 

Somekh / 1 976) had subjects write brief stories after exposure to a matrix in which neutral 
ana emotive words were embedded Independent judges were able to distinguish to a significant 
degree between the stones of subjects exposed to the emotive and neutral words under a brief 
* e<posu'e subrmmai condition but not to a longer exposure supraliminal condition. The findings 
vnd support ti.r the notion that subliminal stimuli can be recovered in a variety of modes of 
expenenee-'p this case symbo'icaHy transferred as imagery in a story # 

The recall of adjectives and nouns from two prose passages differing. only in imagery value 
a* ! K eir adjectives was investigated by Hiscock (1976> to evalua^ the effects of adjectival 
mage'y on prose recall Adjectives from the high-imagery version of the narrative were recalled 
"r.'ire f requen f iy than the low-imagery version Adjective imagery had no effect on noun recall. 
An "d'yidua 1 difference analysis revealed significant group by narrative version interaction visuahzers 
WDer'ormtrq verbaitzers with high-imagery adjectives and verbahzers tending to out-perform 
v-Suanzers when adjectives were low-imagery The author suggests furtheY research to define the 
e*te^ to which adiectival imagery value affects retention of different components of p.ose, the 
medtattonai processes involved and possible application to education % 

R^ardson i ' 376) investigated the effects of stimulus attributes upon word-recognition and found 
that maqeabihty concreteness and number of syllables did not affect performance; words with 
Su M, «es were no more difficult to pronounce than simple nouns, word frequency affected time 
for d-scr minat'On but not that for reading, number of letters in a word affected time taken 
tc read it a>oud but not the time taken to discriminate it from non-words; and. non-words whjch 
emoloyed *u!es of English orthography and ohonology were more difficult to discriminate from words 
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man those wh,cn violated such rules It was concluded reading aloud employs a grapheme-phoneme 
■rdnvdi.on cased AJpo" a iet!er-by-ietter translation of the stimulus, discriminating words from 
non words ot>ey.<H| * . rules .s based on a whol.snc analysis of the s *™ tas - f?*?* 

cfT-mahng words from r«„n words violating those ru'es employs a direct test of the regularity of 
me cumulus based upon the combinatory rules of English orthography ro<;n „ r , 

&otwmick and Storandt (1974) compared a group of older and younger adults wuh respect 
tO'Wr qualitatively scored vocabulary responses after having been matched on the basis of 
me,- vocaoutary responses scored m a traditional way The results cast doubt on the generalization 
m d . vocabulary function <s resistant to the ravages of time but psycomotor skills are not 
Quantitative scores were correlated with qualitative scores and younger adults gave more frequent 
superior synonym responses suggesting a dedme m qualitative aspects of vocabulary 

Wickeiqren (1975) compared retention functions for children-, young adults and elderly subjects 
■ n continuous recognition memory for wofds over retention intervals ranging from two minutes to 
"two rws and found no significant differences in decay rate across the three groups supporting 
me hypothesis mat memory storage dynamics are equivalent over different ages 

Huftsch'and Craig H976) investigated the hypothesis that older adults are more susceptible to 
reran ,nn.bit.on man younger adults and also manipulated the degree of organization of the 
etper.mentai materials Organizational manipulation had no effect for either group and older 
adui's were not as susceptible to response as younger adults 



• Sentence Comprehension 



Sentences form basic un.ts of linguistic .performance, and although a good deal of research has 
•focused on detamng 'he techniques, mnemonics and cues that the language user employs in 
ZpZ>M sentences m spoken and written form, the present s.a.e-of-t e-art suggests ; tha , 
me development of a fully adequate model of sentence comprehension awaits further research and 

an Manv questions ansa relative to sentence comprehension What are |^^ f| J« h '" u "** 
sentences are best understood'' Does the reader have to apprehend the full structural 
Won o a sentence for comprehension? What are the relative roles of syntax and semantics 
* sentence comprehensionv What ,s the difference between recall of a sentence and under- 
( S"wha, aspects of deep structure influence sentence comprehension? Do sen ences need 
' to be fu iy decomposed inlo deep structure before they can be fully understood? Wha, 
' Lmmatical variables -tt'uence sentence processing'' What order is followed in sentence e 
in addition to these questions, much research, relates to methodological issues in 
measunnq and analyzing sentence comprehensfon The sections which follow present recent 
"™s trom thfs burgeoning literature to illustrate the theoretical, empirical and ™lhodo^al 
3y iak°ng place m sentence comprehension research which uses predominantly adult normal 

' ~ Ta'nTs^mefhave been suggested for the order of processing of linguistic information in 
Sentence undemanding based on the surface-deep structure descriptive paradigm m , trans^ 
Lmitmnai curative theories of qrammar which' suggests the order to be surface structure 

0 dlep s rS seman^ interpretation, the first step "breaking the code" of ocgamnfton 
and structure and the subsequent steps breaking the code nouon ha a 
11974) suoqest that there may be some construct validity for the alternative notion that a 
semantic stretegv has priority over a linguistic strategy in sentence perception and comprehension 
by * "ifs D-toential changes ,n semantic-syntactic strategy rniy.be deve^pmenta in ^nature 

1 th the evidence revealing a curvilinear function ,n which earlier stages (betow 18-24 monja) 
represent rudimentary Semantic strategy followed by a syntactic strategy (1 8-24 to 48 ™ n <^ 
the ear'v staqes of an adult semantic strategy (above 48 months). A shift also occurs from 
c^,c .o'rei e 9 sen,atio'nal cogn„,ve-l,nguis„c modes in middle childhood. Results Sugges a , seman.c 
over a syntactic priority in adult sentence comprehension and point to this as a teasible afea 

n Tn d ex 9 an r ,ning S !he y mul.is.ore model of information processing (sensory Posing . *VJ2 
,erm memory et al J M,stler-i a ct*nan (1974) examined a levels of proc ess.ng mooel for sen ence 
comorehension which focuses on type of encoding rather than storage. Meaning recall and 
exM word recall were used to assess sentence memory and deeper comprehension was 
found £ ead t bene recall The author contends the outcome supports the use of memory 
measures as est?mates of comprehension Implications for the multistore and the levels of processing 

m M^hTes'amhhas been conducted on how sentential material is stored in long-term neemory 
bu^ uch research has no " fuU ' separated recognition and recall from comprehension. Carpenter 
n 974, presented a methodology that permits separate, examination of sentence comprehension 
oroceLses and subsequent sentence memory processes. Subjects engaged in a venhcat.on process, 
where they had tc .read a sentence and then verify it as being true or false. The venhcahon 
methodology was evaluated using comparative sentences.^ 
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uvg me p-et^e vesication process of sentence comprehension Carpenter and Just. (1975) 
an-.?Mi vo ! *t- pr ><.es^es or sentence comprehension which appeared to remain invariant across 
"pvif .ifu) !,i > t**,. v .iftjue that tru-Mf model integrates sentence comprehension with the more 
q*»,'vM' artM ttf , t > t j< .1 v.- p-.y* Mphh)v ! jnenhaus Carroll and Bever (1976) critiqued the Carpenter 
a n d .' u s 1 mode! ^ugupstmg it M^en, tiorn theoretical arbitrariness in the choice of representation 
•o'm jr ! d *>oid mat the common set uf assumptions that emerge from the models examined is 
"adequate as a oase for either a theory of sentence icomprehension or verification In a later 
paper Carpenter and 'Just examine sentence verification models in light of the criticisms 

t j't . s'^s of Tanennaus et ai and contrast three areas of disagreement between the 
ta^*o approach and an information processing approach (nature of comprehension. 
"a*'./e J sentence r epresentauon subject strategies) and list errors in the Tanenhaus et al. paper. 
< cV r 3"d Jones d976) reexamined the data presented by Carpenter and Just and point out 
p f ere'm, •»! me goodness of fit between the data and their theory They suggest the fundamental 
r k )f :o-r* r " me failure of the model to account for the difference jn falsification time between 
<en*e n ce-lirsi and picture-f-rst conditions reflecting the model's simplistic view of the inner-proposi- 
' \in ' comparison process particularly as an account of sentence-picture comparison data for 
.jf l .""d! , ve versus negate sentences 

r . h e semantic component in reading is important The reader is thought to extract the semantic 
com*"' a^d fans'i^m » --to its logical deep structure m essence, we abstract out the semantic 
yre i to : ong Vfr memory) and discard the surface representation (short term memory) 
^..:e r s O^/Si explored an alternative view and suggests that, in some instances (rapid reading 
r - rar!'Ou«an the surface representation is an integral part of the encoded, sentence Speed " 
r -ead^g depends on pattern analysis operations directed at surface lexical representation. He 
d-~.!.ng L j,snes between operational or procedural memory and substantive or semantic memory and 
. ,>.jcjmv-. .n mjny cases procedural memory may be sufficient to the information encoded in 
. . .r-'MV fn m„ Iff7 , d | assumptions related to concern for memory of context. 

K-^rtee and' Ramsburg ( 1 974) underscore the increased attention being given to the role of 
,emantu>, ^ sentence processing and suggest this is due to modifications in the theory of trans- 
t'.rmat'O'Mi rr>ar (generative senr>antics) and a revival of interest in attempting to define the 
v concept of ,T >M r >ir.y Using anomalous sentences, the time requtred to process a sentence over 
'nr t . e » T pes of sentences (equatiorva^j^ansitive, and intransitive) into' which of three types of 
an- -mai-es n ad been introduced was the criterion Assumptions underlying relative difficulty and the 
per, eptuai.0'uste f -ng of phrases and meaningful units were supported by the findings. The authors 
~«d 'o add cognitive assumptions in order to account for the findings 

rneor.es of sentence comprehension deal largely with a single literal level of meaning. 
r\v s d9 7 6t r eoorts a study which attempted to examine how comprehension of metaphors 
'jKes place on the assumption that it Qperates in. a two-stage process, first understanding the 
Ao r ds n a ''te'a; way and then using the literal meaning- to construct the intended metaphorical 
."terpretation Evdence for the two-stage process has been found for other types of Aon-literal 
a^guage . Uss n g metaphorical interpretations from Shakespeare and non-metaphorical equivalents, 
"o deferences between the two sentence types were found, using a paraphrasing task, in latency 
and adequacy of the paraphrase Thus, no evidence for the two stage hypothesis was obtained 
ano metaphors do not r equire processing similar to that of other nonliteral language; they 
n"odu'ce no inordinate difficulties m comprehension in comparison to equivalent non-metaphorical 
~'erpre!at»ons 

Just and Carpe n ter ('9761 explored distinctions between the processes that, take place in 
.;e n te n ce memory and sentence comprehension The experimental task required subjects to read a 
',e n !* p ce and verity whether it was true or false at initial sentence presentation. Results indicated 
tr >at . s jrrpiex sentences are often recocted into a simpler form during the act of comprehension. 
vi:e f jV f, g r ecan the recoded representation may be retrieved Thus, the "meaning for gist" 
e< f ec' ^av be due largely to a "comprehension of gist" process. 

\ W o, e r ii974> conducted four experiments in which college students read sentences 9-13 words 
V \engm a f ter neanng fou' context conditions' word actually in sentence, cuJto a word in a 
^»e r tence .30 word two sentence context either general or specific, no context iTime to process 
sentences was the dependent variable No significant differences attributablexto the context 
cond't ons were found providing context did not facilitate processing of sentencfcs^contrary to 
conventional w-sdom assumptions c ')out the efficacy of context in deriving meaning. 

Much research has been done m assessing the role of overall grammatical complexity and 
d»^e r ent types of grammatical units in sentence recall and comprehension, Layton and Simpson 
i ,,J ) 7 S> reported an experiment which investigated whether the match mis-match between active^ 
and passive sentences and questions would be observed when the comprehension task was made 
more dtft'cuit Dy increasing the load on memory When one sentence was presented less error 
occurred when sentences and voice were the same than when they differed. Both, surface 
and deep structure are retained after two questions, only the deep structure of sets of four and 
eight sentences are retained The view that surface information is retained in memory when 
mere >s an appreciable memory load has to be questioned 
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Cook 0976) examined tne variety ot explanations advanced tor the difficulty of plurally- 
imbedded relative ciauses m English and advanced the hypothesis that errors m comprehension of 
-e.ative clauses are ..a.ced by perceptual strategies resorted to when the normal capacity of the 
processing chanr.ei tinted Hesulis using native children, foreign adults and native adults 
suggest this to be the case An overload is reached with a single imbedding for children 
and toreiqn adults and with a double imbedding tor adult native speakers 

fn expiring the grammatical dimension of reading ,t has been found that reading performance 
d , an levels indicates application of the reader's implicit knowledge of the grammatical structure of 
me language Sawyer (1975-76) found the type of grammatical structure being processed affects 
ieader dependency upon cues m the visual display, more predictable piirases are recognized with 
omv minimal visual cues while less predictable phrases require a greater number of ' detai s. 
Grammatical constraints operating within passive right-imbedded sentences offer fitmer syntactic 
cues than active-left Imbedded sentences Agentive phrases and locators share an equal like hhood 
for occurrence as skmed readers pose hypotheses about what may be expected to follow a 

" Pa Aunougn b several theoretical positions and a variety of empirical tasks suggest the primacy of 
veros m carrying the semantic core o< meaning in a sentence, verbs are striking y inferior in 
memorial task's and nouns are generally recognized and recalled better The ; verb of the i sentence 
defines the plot in a play the sub|ect merely indicates one of the actors. However when 
subjects know they must remember something about the play (sentence), it is the actors! nouns) 
tha are remembered This has become known as FHIenbaum's paradox Reynolds and Flagg 
,976 attempted to resolve the paradox by several experiments which explored the efficiency of 
various sentence elements in recogmtion memory. The total results suggest that verbs are 

• processed to a deeper semantic level wnlle* nouns have the form (orthographic, phonological) 
recorded Tmo-e strongly The theater metaphor holds and is supported by a semantic 

e 'senfences d wi'h self-imbedded relative clauses have received considerable attention in the past 
' decade and are 'regarded as being extremely difficult to comprehend. Much research has gone into 
understanding why Hake's, Evans and Brannor, (1 976) suggest-that experiment^ purporting to 
oemonstrate this have confounded the self-imbedding or right branching location of the relative . 
clause w m the.r internal structure, comparing self-imbedded ob,ective relatives w tti , right-branching 
Sublet relatives -Results show ihat when internal grammatical structure of the ■ 
>■ S» controlled there is no evidence whatsoever that sentences "•containing self-imbedded 
: -five's are more' difficult to comprehend than ones containing righ^ranching re I. ves^ Jey 
' suooest that findings of past research can be accounted for by the feet that object relative 
crK are more difficult to comprehend than sub,ect relative clauses rather than self-imbedded 
relatives being more difficult than right-branching relatives. • • ..„.,_ ron<5idera . ' 

The skilled reader interacts with graphemic, pattern semantic-collocation and syntactic considera- 
tion^ * ^ in reading. Wisher (1976) explored the idea thai- our eW 
L.ion'of what ,s to come proves a catalyst that guides mapprng frorrt flrap^e to 
meaning ,h developing a hypothesis which is tested against the incoming data^th this 
mformaLn the reader assimilates what is expected with what ,s first seen. Young/ and olde 
readers do anticipate syntax, almost on a word by word basis. Wisher examined th^nfluence of 
know ng th 9 syntlcho structure of a sentence beforehand in reading. He confirmed that such 
knowledqe allows a sentence to be read with less effort and faster by reducing the need to 
store m sSort term memory and engage in linguistic computation. Anticipate of structure IS an 

'TSfSsr^eXed '.he role of speech receding tail en, speech .inner speech sut, 
vocalization) in sentence comprehension in three experiments and concluded hat the results 
provide support for a model of reading sentences in wh.ch speech reading ° c 7J. s o ^ J exlcal 
access thus facilitating the temporary storage of words necessary for se ntencedc ^P r f ens '" n . 

Newstead (1976) questions the primacy of syntax in sentence processing and suggested that 
seman. constraints' are a. leas, as important as syntactic factorsjr . sentence f*^^ 
Using reversible sentences (interchangeable sub,ec, object) in three expenmen s it J «^Own that 
semantic constraints exist and relate to sentence perception; the power of the constraints is 
rented To sendee complexity and>he constraints derive from the overall relationship described 
m the sentence The results of previous work which failed to find semantic constraints could 
be explained in terms of the strategies subjects adopted in the various tasks. 

Holyoak (1974) re-examined previous research on semantic memory and the effect °f imaging 
" " sentences on ve if.cation tasks by having subjects rate the semantic relatedness of subject and 
oredtate words in sentences It was concluded that the effects of rated imaging could not be 
dswh« from S of the semantic variables. Greater comprehensibili.y was reported 
,of h,gh!mage y sentences. Many questions remain on the role of imagery in sentence ven- 
fication a™d research in this area can, advance only if variables like semantic relatedness can be 
controlled 
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Pezdek ana Roye' H974) explored the proposition that concrete sentences are stored as 
*;sud' .mages wnereas abstract sentences are stored as verbatim strings Poor detection of many 
M.Djetts '"dy XHt no.wever to men comprehension of sentences rather than differences in 
storage •M^^cinisr*-, '"i absUa< 1 1 unuete sentences Recognition for meaning changes in abstract 
sentences was significantly higher fix sentences embedded in a paragraph designed ta increase 
comprehension, than *or the group presented sentences without paragraphs There was no 
apprecatfre differences for groups in recognition of wording changes in abstract sentences or in 
. ? r eeog f v!'on for botn wording and meaning changes m concrete sentences Results are discussed 
'ems y a fferent storage mecnanisms for abstract and concrete sentences 

Pav- e f, d P't.vtor .i ' 976) had subjects observe both concrete and abstract sentences prior to 

• df'vng ju! a detractor task wn ( ch mvoweo either verbal or perceptual coding In agreement 
•vv m »^e: f ^vWtnt'vs se n tence type interacted significantly with task Abstract sentence recall 

r *as -vg" f'caru v more mpaired foiiowng tne verbal task than the perceptual, while this effect 
' wdb 'evened 'or concrete sentences In another experiment this difference also reached 
sg^ f: ca n ce. a^d 'uM^er eno* analysis confirmed that perceptual task influenced the semantic 
■: >n*e*' me sentences 'ecaHed The studies lend support to the contention that the mediating 
p» -ensues jnper y ng apprehension and retention of concrete and abstract sentences are 
ou.i '.i' deferent Me^tai imagery piays a role in internalization of concrete materials but 
jrvat • -.envies a-e n>»o ated by an essentially verbal process : 

Learning f- >orn Prose 

- -ec^ 1 ' *ea r s '^e r e ^as tit^n a steady increase <n the amount of research reported on 
rM .. rr .rr- ;«r^e As,F'ase il9. 7 2i points out. The cumulative record of research on prose 

• r»" a ■. v-e past teV years s a positively accelerating function" (p 337) Much of this has 
•»«i.r. " . ivo "wrs.gh me .nitiai work of Ausubel on meaningful verbal learning and Rothkopf's 

.if: 'T-af^e^age^'C Jactorsjn text study ' 
v\ •»"• *■> ' ■ ■ •! ' r> a n v co^ege:aduit reading programs being, in essence, learning to learn 

• p. r ' ** i '..d'MS wf.ch attempt to poinj out the importance of the manner in which 
T' - ^ " i'* 1 ' -i '-'oa'V^d for presentation and what the student brings (o and does during 

jV j , ■ r an Dr-vde fruitful -nsights and research directions tn the examination of 

■ rj'- ado ' id^g Biivv." i'^.'^i >f anaiy/mg story understanding and recall among college age 
i t". a' ■ n -a • '- r '. -jd» 4 cJ i 'n.nk me topic (processing and understanding text) too central to 
.vg'-'te o^». r% 0'.,gv * ; .;r u s to continue postponing investigating it until we first figure out how 
f "onse^se sv'^aP^- am : eamed' (p 533i The most recent NRC reviews M974, 1975) 
_ r ■ ;deo sca"e r eo 'eoorts on o r ose learning Because of the increasing potential of this area of 
»\.dv *t j v.e»% ">: udes a good sampling" of the'm^r.e recent research produced as a result 
4 ac'.'t^- educa'e^a 1 psychologists examining 'basic notions ,tn prose learning and 

a ~. s rr:;,y ,\f . 'I' Que- and renews 'elated to prose learning have appeared rrase (1968. 1970. 

• i'. • ^7=-. and 'Carroi' i^9 7 i i review earner studies statistical techniques employed in 
i>: g^e- r -n s^d-es wem 'eacted to by Ladas (1973). Anderson (1 970) and Anderson and Biddle 
' i ,c » ana »/Md me prod'gucus 'iterative on interspersed and inserted questions, apd Rothkopf 
' * i.'iii provides an overview o f the mathemag^hic concept The recent review by Faw 

Wa-'e r 'i ' 9 T 6; o'gan.zes 'he 'aerature mto useful categories j^dvance organizers, response 
' r -:des ocject ves* ^se r !ed questions) and provides an excellent overview and critique of prose 
>\3r^ -g *^sea f : n ,n roug h . ea^v 1 9 7S Although there is inevitably some overlap this review extends 
•■■■»* >-,i*\ a' d Wa 'e r c/ '-aue w »^ subsequen!!youb. li shed materials 

At-^-y t^e ^vPC^es s t h at th e greait' the deg r ee of interaction one has with material and 
grea:e r * n e oe'sona; diosy^catic organization one can place on materials to be learned the 
g*eafe' ,r, e 'ecai ; nc?e:ak:ng has at ways appeared to be a useful study skill Howe (1974) reviewed 
r^ S ear^ r ,\ r .v M examined learner strategies m producing and reproducing information made 
a.aiiari..' pr.. jS v and me facilitate effects of notetakmg Research contmues'which. attempts to 
tv.yiufa -^upon^ 'or the uM ty of notetakmg and to define the optimal set of circumstances in 
t .v^ cn s sk i» ib be n ef'c:ai to learners 

P she r and Hams (1973) compared -several groups on retention of materials presented in 
ectures a n d °cted no difference between groups on mc and free recall measures In a later study 
1 1 -i 74 » -»hey n oted that subject s preference for form m - taking notes may interact with the 
►>x[,e' me^tn ; conrjtt.on subjects* appear to do better when they are allowed to encode in the way 
that they prefer They also noted the lack of a significant review effect and superiority of females 
ove r ma'es " / # ' ' 

Peters r , 9 7 2) v/a r, ed me presentation rate (normal 146 wpm, fast 202 wpm) of lectures and 
t tcund n 0 tetak:ng fj'-gmticantiy reduced recall for both conditions on a mc test Negative effects wet£ 
'■^ost evident for icw efficiency readers Aiken. Thomas and Shennum (1975), however, fouhd 
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, , vr . c ,-,-, K r ^. vs j^rpg a one* interval m which the lecture was stopped, recalled 

\, ." 'i' . * ■■•e" .'-iJ^'s V.ok notes whHe tne lecture was m progress oc no notes at all 

' ' ' ' ( , ,,, „„. t ,t Jei - »v . ,f notetak.ncf versus listening on connected and disconnected 
J r \p ( .'" lJt . ( j ir., "i^yvrat act of notetaking need not interfere with the 

'vri^-^ - of -a- -mm- Td!w.«i mm.-MMiy was not more effective in immediate recall of 

' ^ C S^^ and "v^v Ann, and Davis (1975) analysed students assigned 

. r . vv^v's Vd a controt group with notetaking conditions mampu ated to test note- 
. ' : / ^..o-j orocess and notetaking as an external memc.y device for review Re- 
. .;" 7* . wv -^a-ng and ex'e^ai memory functions are important for successful 

. : r'S\ . - 'ac ^ personally encoding the lecture which provides the best retention. 
a 4 :'.m je'v-ce -s mpoftam 'or review, whether it is one's own notes or those 

... - e h<». Menta- revew aune was singularly unsuccessful in retention /isher 

v ., " i iM# r < ^Ajmrned the encoding and vernal memory aspect of notetaking 

I-!; .ry*e qmuOb varying n amount of rev.ev revealed superior recall for those who 

^'^/•ne.' Ve^.suqoe^ny the external storage function of notetaking .is a critical factor 
\ - ^ecai' Res^ef a study by Carter and Van Matre <1975). analyzing the relat.pn- 

. r. '.t'-^a^c and .Pve* supports the desirability of taking and reviewing notes for maximum 
" vv'»^a» ; d 'eea-i efU encv They suggest ihe e>terna! memory function is of the greatest 

"■rH.-Mr-t h efHjui'nq J Merences being minimal m impctince 

' p,^i..V. »nd Be»re»Mi974» examined the current perceptions of 223 univers.ty students on 
•3. - j ana ^eaa.ng notes - study and found students reported regular notetaking and view 
J as -Do-ant >n study No differences were found between successful and h.ghly 

. ' * s'* : tudnnT< ;cPA) more successful students seemed to be better notetakers No one 
C ittl i«' ' t*« r"-fxi notetaking although such device advice consistently appears in how-to-study 
w / A , , a . t>v no.rvted out ^ studies of notetak.ng over the years, notetaking is a skill that 
„■ ^ Kifv j ' ave max:mum pay-off ' 

>; a k ' jessed the effects of notetaking on retention of printed materials Both note- 
,,,, <rr proved encoding and recall but the effects were not additive Encoding 

k" J * is .iw-d -n notetaking from print Also working with print materials. R.ckards and 
A . ,,,V- qV 1J H,ed <hP effect of subiect-generated and experimenter-provided underlining ot 
^^^.'•w pe^ paragraph and found subject-generated groups recalled significantly more 
„ ( . at ; nV- atenai supporting somewhat the encoding hypothesis advanced tor notetaking^ 
vr and ji-Kns ,!9/2> used differing passages of text and found that underlining was not 
Xa-i ^>Van 'eS reading" for either passage Friedman and Wilson ( 975) glued 
™ c>< -Vu^e iexfs soid to students and examined them after the course to determine text 
♦ Jy> anj a^u- .v underunmg of maior and m.nor points Students made greater use of he 

; a f t , p d 7. of me text the earner parts of the chapters, and the chapter summaries; proportion 
f d f ;,; g vnowed a similar pattern with females underlining more, and no difference in 
v^'aae wa, * 0 und -n use of maior or minor pc^s or between students taking essay and mc 

' X f2 and W*-e' H976I observed that in notetaking studies time wis usually controlled and control ' 

• • ' , . 4d "s udies used aural rather than visual presernlations suggesting more work is 

^tj p^e presentat.cn modes and a need ex.sts for studies assessing individual 
,;C^cU T^atrnen' Dv aptitude research could 'reveal more promising interactions and 

mci cavons f or pract'cai application In this area 

' '«,!';';,: l^'^Wl ot mstruct.onal obiect-ves-as orienting stimuli in prose learning continues 

^ ' ' ^nK-s" and Ne.sworth (1973) found that performance on test Hems was consistently 
suce o- 14" - e copending ob,ec tl ves were provided than when they were-'absen Duchaste 
- arS B^ow" ,'974, reported that students provided with a list of ob|ecfcves related to text 
oette ma P ° controls on ob.ec.tve relevant content, and poorer than controls ^ on 
. ,.l °Z Car-, questions suggesting that obiectives enhance learning and ,-ay depress 
V"M- earnnq Heston ,1 975. generated s.gn.f.can. correlations between use of finance 
■r^r-vp' a"0 oasc reading sk.Hs mastery of text material, and knowledge of obiectives The 
* Tr A c use of objeenves scored h.gher than the group receiving ob,ec ves 

."aPor e and Nath (i|76) found thai a hard specific goal prior to learning increased 
* i 1. '/dp- --ranee goals and correspondingly increased the information gamed during reading. 
1- °? Vir^nnP a/,d I Rothkopf (1975) Rothkopf and Kaplan (1972). Kaplan (1974)., 
' k ^ d S mmo^ ? 9 fi ^ K a R p° h n and Rothkopf (1 974). Rothkopf- and B.lhngton (1 975a) and 

^ a P J a4 ^ e zberg ( 1975. provide substantial evidence that learmr^ ,s chanced by ogives 
-twangs facilitated a. !he expense of incidental leam.ng. a Ithough the -effect of 
r^er.» Pammq i !pss dear Specific, distributed ob)ectives Enhance learning and may ?\so 
5e ; v e a -ev-evv '^cton Duchastel and Mernlt (1 973) caution that ob,ec.,ves may not always be 
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effective if they are too numerous they overwhelm and confuse the learner and in and by 
themseives rarely reduce study time Duell (1975) noted no significant effects for specificity 
and knowledge of objectives on mc test. performance and concluded that stating Objectives may 
only enhance learning of rnalendls that are considered unimportant by subjects. Stumpf (1974) 
found that prefatory objectives clici not affect immediate achievement of communis college 
students When number o^goals and similarity between the wording of goal descriptions and 
segments of text were systematically vaned, Rothkopf and Billington (1975b) concluded that 
too many goals decreased learning and text specific objectives enhanced learning 

Faw and Wallers (1976) review supports the facilitating effect of objectives on intentional 
seaming wnen objectives are more specific . less dens'e, and distributed with each section of the 
text They pomt out that efficiency per unit of study time needs to be 'carefully considered in 
sucn research f 

Advance Organizers Onentmg and Active Procedures t 

Advance organizers and orienting procedures in prose learning continue to interest researchers. 
Advance organizers consist of various kinds of introductory expositions presenting either new 
concepts under which detailed future learning can be subsumed or drawjng distinctions that 
enaflle the learner to discriminate new concepts. Orienting procedures consist of learning 
instructions activities, and information, beyond adjunct Questions, which allow the individual to 
become actively involved with the learning materials^ The activities which sub ects carry out jn 
memory processing are important r , 

Advanfe Organizers Andrews (1975) compared posttest performance of siibjects who heard 
either a cognitive organizer, a list of directed reading questions, or both prior to reading a- brief 
passage The cognitive organizer group demonstrated the best retention of the passage Schnell 
*i973) studied groups of college students who we(e provided with preorganize/s, postorganizers, 
both and no organizers in learning from a passage on neural maturation. The three groups- 
receding organizers exceeded the control group with the postorganizer group performing best, 
Rickards M 975-76) organized five conditions superordinate precontent (advance organizer), 
superordmate postcontent (post organizer), coordinate precontent, coordinate postcontent, and 
control to determine the processing effects oj conceptually subordinate and coordinate organizers 
interspersed in text Superordinate precontent statements lead to significantly more recall of 
organizers and subordinate facts than any other type of inserted statement. Advance organizers 
produced more contiguous recall of subordinate facts with organizers oer organizer than did 
. post organizers Only advance organizers generated conceptual recall incidental to the organizers 
themselves 

Jenkins and Bausell (1976) pursued Ausubel's notion that the learner's present knowledge is 
important in determining subsequent learning. Four experiments were reported. In the first prbse 
learning was facilitated when meaning of key terms was built into cognitive structure. The second 
experiment was run to determine if the results would generalize to school settings and produced 
discrepant results £xpenments t three and four further analyzed the effect of number of terms and 
ratio of terms to prose material The effect of number of terms was disconfirmed and the effect 
of the ratio of terms to prose material confirmed. Relevant ideas anchored in cognitive structure 
appear to facilitate learning only in materials with relatively few contextual anchors and establishing 
anchoring ideas in' cognitive structure prior to learning does not necessarily transfer to fairly 
liberal school-like conditions 

Faw and Waller (1976) enumerate many of the problems in advance organizer research, 
related to lack of definition and the real effects of pre and post organizers, and conclude that 
when materials are difficult or novel, or when subjects' ability is limited, organizers that are 
general or abstract can facilitate retention of materials Studies reported to date are limited 
methodologically .and. although advance organizers may,.prove to be beneficial in learning from 
prose materials, the effect has not been conclusively established, For example, Schumacher, 
Uebert, and Fass (1975) explored the interaction of the amount bf higher level structural 
information given to subjects (advance organizer) and passage organization. A passage typical of 
textbook conditions was presented to 1 44 college students in either of two formats: one long 
paragraph or six separate paragraphs One group received an^advance organizer 1n reading the 
■ passage ANOVA revealed that subjects performed better under the whole rather than the part 
paragraph organization. Cont r ary ^expectation, the interaction effect showed that the paragraph 
organization mam effect was due' primarily to differences- between the two paragraphs when an 
advance*' organizer was not present The passage with neither paragraph cues nor an advance 
organizer produced better recall than the passage with* paragraph cues but no .advance 
organizer The advance* organizer passage was intermediate regardless of paragraph structure. 
The" authors hypothesized that subjects, in the. absence of advance organizers, carried out activities 
aimed at generating their own structure for the passage. A subject-generated c plan for a passage 
may be superior to an experimenter-generated plan • 

Orienting and Active Procedures Arkes, Schumacher and Gardner (1976) studied Jhe effects of 
several orienting tasks on retention of prose materials. Three semantic tasks were employed: 



».r-"q ana scrT.ng const-tuent sentences. 'and two nonstmantic tasks, circle es and 
i w> "*t*i-rtn "and 'ecai' per unit nme <eU«/iency» were dependent variables 
rVj ^Vv ; 'i uood recall by forcing interaction with the material Non- 

ng ta.A aftecttng the quality of the interaction with the 
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, , j v t ., rj4 ,..ni ti ,nter,iLtiori 'or intentional conditions Maximum recall and 

ency^vc^-ed a s.mp.e. ns'r u ct.6ns to read the passage with longer time periods leading 

' ^M>~e andKre'^frerg 1 1 9751 analyzed the effect of felling readers what sentences to learn by 
\^J. ) v .... *,.t<j,"r sentences or topics of sentences Five types of learning directions were used 
. ,\ H f i m a,., j-pegd* v^wcai pos.nve topftrai negative and control {learn the entire passage). 
vV , f j , t ^ f ^ ce s bet not top-cai re^ren.e facilitated relevant recall and depressed incidental 
, Ht j rr V ; rV4 :r support !h« importance of precise references in the use of learning directions 
■Ye .-3Prv»» d-.sjrne however tr*at active response invariably leads to increased leaning Howe 
v-d S--oer asked subiects to read a passage of prose under different conditions chosen to 

r rt) resen» 'ea"sr.c study strategies In analysing three conditions .{reading, copying, 'jmmanzmg) 
n*'i'v v expectations 'eadtnq alone was superior to both copying and summarizing with 
\' t n-n. v / e »s~'earn,ng more than cop.ers Results held for both long term and immediate retention 
W-s i« J subsecuent expe-ment reading the passage was contrasted with listening to the passage 
<-■ nq s < d<Me»er>t cognations of presentation and subject activity The findings of the first 
^ne- Mvn' **'fe 'epi-oated' Out Atn auditory ? presentation at dictation speed, there was no 
\li cr j,t»e:e'^e between subjects who "read and subjects who recorded The, authors 
c. .-qest mat me m.eract'or of presentation forms be considered in research on meaningfu 
.ea'^.r' r : onk iiQM)- reported that the listening mode' was superior in recall for all 
n *n r »r >s nq materials that required physical manipulation In -units requiring no physical 
- r-'MjM'-"- ■*. l r»' , - f, q was more difficult foi poor readers only 

* 'n,\, c! „S ,<,!•.., sought to determine if large numbers of college students could modify their 
t>.'-iv <<r y3a • rrmnatton of study skills advice and behavioral self-control techniques Treatments 
v^o •lP..- t t .r*-o '^'ouqh typed handouts Selt-mon.tonng (observing and recording own behavior) 
w 1 *u : an effective addition to study skills advice but stimulus control was not Stimulus 
'..V'tmvW^ '!•<■ comphcaced and un wieldly to teach in large groups, may not be ammendable 
♦voed ^anjout treatment and may require too radical a change in student behavior The 
i.H.ca- v of 'ne handout treatment for a large group of subjects is an interesting finding in this 
study and .ont-ms the resets of Beneke and Hams (1972) who found handouts worked as well as ^ 
•r t^prapv seminars m teachmc^study behaviors 

^ R.v s and D- vesta H976) examined the effects on retention of listening to versus actually 
crpsenj-oq 0 rat summaries of prose materials - a 25% advantage in recall of relevant concepts and 
»3« : »V*as"'ourd when an oral review was required compared to no review, and a 13% advantage 
v3s found m ^caii when an oral report was made compared to listening to a review Ora 
>eve* .s an Effective study strategy for enhancing recall of meaningful text material. Oral 
v'v.'.es croduced positive significant tow correlations with anxiely symptoms suggesting anxiety 
^,es an arousal function relative to learning but lt need not be debilitating in terms of performance. 

r-.y se .1974, mvps'igated the acquisition* course material under repeated testing and found: 
*-»n ted study intervals acquisition incased over trials of study followed, by testing; with 
s'udpn* deterrr.ned study mtervais no effect was apparent (or repeated testing However, repealed 
w -q m me orient study was .mportant primarily as a way of incread.ng study time If study 
'^e can beJnceased Dy other means repeated testing may have no advantage 



'We mHmi-i of qu'V-ons inserted .n prose learning has been attributed to the control which they 
p< pf » .v /t v'»hp general dasses of mathemagemc responses, taking* place in inspection and 
orncPss-nn *n c n tne iea^er gives to prose learning .Research in this area continues exami- 
«at.on of sucn variables as question position, type of question, passage characteristics, source 
of auest ons ^mediate and delayed recall, incidental and intentional learning, reported study time. 

r o pamef characteristics related to prose learning 

' Meider Mi'M ut.hzed multiple matrix sampling techniques to cover all questions subdomains 
,Buy^s Comprehension and Knowledge categories) and Bormuth's procedures to insure 
production 'of test questions most relevant to prose articles, .n testing the effect of inserted 
ouesrons and generated questions- The inserted post question group performed significantly better 
or knowledge and comprehension than either the generated questions, or control group on 
t^^TJ but not on the delayed retention test Packwood (1976) inserted questions 
De'ore and after prose ' passages and found significant main effects for topicality but not. for 
rtcv.on or spat>ai contiguity A s.gmf.cant position by topicality interaction was noted sugges ing 
ore Questions pos-t.ve. ?alfecfretent.on while post questions do not HiHer (1 974) reported tha 
iook.nq back does not necessarily facilitate learning but this may have been due to the control 
o' siudv we Boker (1974) demonstrated that inserted questions have the same effects when 
assesseo by decayed and immediate tests of retention * 



Snowman and Cunningham (1 975) examined the effect of pictorial and written • questions 
interspersed through text m four conditions Reader generated pictures and experimenter provided 
questions were equally tac dilative supporting the notion that pictures have an e'fect similar to 
that of questions .iml suyqusnnij that the range of study of adjunct aids >n prose learning be 
expanded to tnciude visual arid verbal prompts Rdyer and Cable (1975) suggested that subjects 
fading an initial concrete passage followed by exposure to an abstract passage would have a 
new Knowledge structure in contrast to subjects who first lead an abstract .then a concrete 
passage The concrete first subjects recalled a minimum of 40% more material In a subsequent 
study M 976) they<compare*d the effects of passages labelled abstract with illustration, abstract with 
analogies concrete, unembeihshed abstract, and control Results proviaed substantial support for the 
contention that the tirst three passages would have a greater facilitating effect on learning. 

Ricka'ds f 1976a) teund that the generally noted postquestion superiority effect did not hold for 
at! type of verbatim adjunct questions Verbatim postquestion* /did not generate incidental 
learning High structural importance postquest'ons did not generate subsumptive activity but they 
oniy/equtred literati recall ot the high level information they solicited When deeper text processing 
is 4qtiiLeci.. significant incidental learning has been found to be related to question type. 
RiOarjs d976b o subsequently examined the effect of conceptual and verbatim Questions placed 
prior -b and after text segments Conceptual prequestions were superior to conceptual postquestions 
v-eided 'ess reca;i ihan verbatim postquestrons Only conceptual questions were superior on delayefj 
r ecai'. tney produce more htgnly- structured ano^ organized memories than verbatim postquestions 

Facititative effects tor higher order conceptual questions have been found in a number of 
sfud^s (Frase. 1 9 7 3, Shavelson. et al . 1974, Rickards and DiVcsta, 1974, Felkar and Dapra, 1975; 
Mayer 197 5)' although the effect can vary according to learner characteristics, question position, 
f vpe of learning, nature of the prose material, and frequency of questions 

Adejumo d'W' compared the results of self generated questions, questions' generated by other 
students, experimenter generated questions, and no questions finding a Significant omnibus K Post 
hoc comparisons revealed that the group using questions generated by other students performed 
significantly better than the self generating question groujb for total and incidental learning and 
tor intention^ -terns The experimenter question group performed significantly better th3n all 
others on interzonal iearnmq GPA also differentiated subject performance Mofse (1976) reported 
groups who used questions either pre or post or experimenter or reader generated, did not 
differ significantly on the posttest from the control group but experimenter generated questions 
tended to be more facultative than reader generated questions 

Voss 1 1974) analyzed effects of two types of post questions (mc and completion) and found 
mat performance failed to improve over testing trials Multiple choice questions, with repeated use, 
actually produced interferences with higher level learning Rickards et al (1975-76) reported that, 
witn respect to intentional learning, inserted number and common word questions yielded equivalent 
'esuits regardless of text position Only common word postquestions, however, produced significant 
recall results Neither common word nor number postquestions, however, produced significant 
incidental learning results with number questions producing a shghlly negative mathemagenic 
effect Htlier (1974) examined the effect of easy and difficult distractors finding that hard questions 
'equire more processing , 

Studies by Howe and Colley D 0 976a. b) suggest that prose learning will be influenced by 
the set established by passage variables and the nature of inserted questions Appreciably better 
performance was reported when questions were similar to those asked m earlier passages but 
not when earlier and later passages were similar 

Rothkopf and Billmgton (1 974) found that performance on a test item is facilitated when there 
is good vendicality between the test item and the text content both when rein^pection is 
allowed and forbidden In addition, attempts to answer a question tends to improve subsequent 
ability to answer other questions from the same topicospatial neighborhood of the text, but this 
indirect priming effect was small s 

King (1974) suggested that total learning time may be the most important variable in connected 
discourse learning and Faw and Walier (1976) found lack of control of the study time variable*' to 
characterize much earlier research in this area More recent research is beginning to examine this 
factor Bull (1973) and Bull and Disney (1973) point out that the primary value of questions may 
be their arousal effect However. Zilman and Cantor (1973) reported that questions did not arouse 
curiosity but suggested the arousal function may be most important when attention is falling 
rather than in first learning. h 

As pointed out by Faw and Waller (1976). the major issues in the mathemagenic effect of 
questions relate to verbatim recall^ersus comprehension (Anderson, 1972), applicability to normal 
classroom learning situations wl^t extended textbook materials, the extent to which different 
ability levels respond to inserted questions, and incidental as compared to intentional learning 
effects Only recently have individual difference variables been linked to qufistiofuuse (Shavelson 
et al . 1974. Adeiumo. 1975. Sanders." 1 973). Hiller (1974) found learning from proSe text to be 
correlated with anxiety and self-ccnddence of subjects Koran and Koran (1975) Reported that 
frequent questions may be best for low ability students through reducing the burden/ of semantic 
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vesvo a-d venqtnen.ng assoc.at.ve learning Their analyse of instructional treatment and 
. 4a.tr;*c factors revea.ed h.ymy *gn.hcant treatment effects on the ANOVA 
J,'*,.* o,,eston pac.nq vanes according to learner characteristics w.th differences 
, . . .[ , .•<■• i ' -I i ,,.| a.u.ntH.n.n iea.n.ng ATI studies (Salomon. 1972) could provide 
.„ .. ..si..' >'« ■ /».■ .tin vme distance from developihg decision rules which will 
j, -w 't. "re ass.g.-"Vf' .'ujents to alternative treatments designed to reach the same 
"A, ^hi^L^ves <see Crorbac n and Snow 19/ *) . 
-.v o'oaoesf app^at.ons o» mathemagenic questions .n learning from text was reported 
», .K^/ano He"fM76 introductory college physics students were taught a skill for acquiring 
' ,. -c-op o» a quant.tafve 'eiat-on a generally specified set of abilities which are 
appiv-q relations m proo-em solving To develop the skill, students studied 
nterspe'sed questions -equ.nng demonstration of the specified abilities Such 
antiy enhanced acqu.s.t.on of these abuses trom a subsequently presented te« 
... ,,,,„, ,. .Ihw 'eiatons demonstrating t-ansfe. effect A person s strategies for learning trom 
•.., „) im,,^ ,,se ot systematically designed study materials 

'''uv. '.tq'antravon of cose can pave substantial effects on what the student remembers from 
-ead- q iflothKopt !97ji Our knowledge about the processes involved in remembering from prose 
ea- 4 fags far beh.no -a. about memory for' word lists Several recent representative studies 
wn, C n e.ammed variables m text structure affecting the complex process of prose memory in 
••■>! eqe-adu < populations are presented here ™-„i, a «i«n 
*2naar .'975' 'estea the effects of organization by rqferrent. attribute and random organization 
-,~-q Vauest.ons and written recall Half the college subjects also visualized the pages. 
M,.-.^,.„i Htf.hote and/andom organization of text produced the best comprehension in that order 
untie ' ■< v«t organization, .magery instructed subjects stated significantly more lex 
... .mao.«v j'so roMuted ,n a significantly higher level of inference making and recall 

0fga r zat.cn wh,cn was consistent with figural representations of text information. 
I - ™, wen '•/»•. "s >o form and preserve unitized representations of text information. 

Seoli La 'nan and Burkett (1974) conducted three experiments in which tex readability 
u.wered )"d tape recorded supplements were made to text to assess the effect of text 
mf^tva ,Vand ^supplementation on text comprehens.bili.y No gams were evidenced through 
'adab-Hy iA,r Force technical training materials and look-up manuals) or auditory suppte- 
How^ver for home study, as compared to on-the-,ob instruction, lowered readability 



'esu ted -n nceased cr f tenon scores 



^ C^^d.^ mo5e.. Johnson ,1975) classified 200 college freshmen as Good 
age or above average comprehension), D.Merence Poor (average or above average vocabu- 
a-f bo- be.ow average comprehension), and Deficit Poor (below average vocabulary and con e- 
^^n feaders to examine the interaction between readability, subjective organization ab rty 
,nq passage o f oan°zat.on ANOVA on literal and inferential comprehension post scores revealed 
ettecis supporting the hypothesis that readers use text. organization processes ,n 
' * we i as 4ad n° lor comprehension The activity approximates an information processing 
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Cc-prenendmg p-ose is no, oased on nterar recall of exact words and research continues 
y "v subu-" wh, C n are most 'ikety to be remembered in prose learning Johnson (1974) found 
.n e ^H^rg o* recall of hngu.sbc submits of mtact textbook prose was strongly related to 
senamc tensions o< abstricfconcreteness. specificity of denotation, comprehensibi ty. and 
nC£. ,s ^sts ol mmediate and delayed recall Presumably, modification of texts on the four 

'* \r%tZ™£TZ°tLe effects of isolation (underlmmg) readability (onder^anda 
Vd paragraph organization untersentence structure) on learning from whiter 'instructional 
L -c -est as the criterion Only paragraph organization had a significant effect on 

tnl^erac'ot among the three variables were detected. Carver (t 975) assessed he 
manipu aVng attention dunng prose reading using a reading input technique called 
„ iqrar^d 3 ££ ?««-« to Coze 9 and programmed _,ex^ ^^L^LZ™* 
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n low motivation conditions but acted as a distractor in high motivation conditions 
•■-<- et'ai 11975) presented short English texts, controlled for number of words and 
ipos Vo-s bul d, <er,ng ,n P word concepts m the text base, to college stuoents to assess the tunc ion 
.^era- ron'en, enables influencing comprehension and recall; of tex. material. Number 0 
>4 ™ts affected reading time and recall Superordinate propositions were recalled better 
^ ^I 'fnam propos.t.ons and forgotten less when recall was delayed Recall o word concepts 
-x-ea^d ,s a function ot the number of repetitions of the concept in the text base and the 
of Wl.ons ol the corresponding words in the actual text. The effects held for 

"^n-ced'roadtrfroport the ability to remember where certain information is located in 
Passages and books Chr^.ie and Just ,1 976) used ingeniously constructed text passages to 
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examine now subjects remember the location of a sentence in a passage and how they make 
use of that information in retrieving content information from text They found location of 
material could be remembered but. did not guarantee memory for content Also, one does not 
necesscipiy have to Mn>w where the information is located in a text to retrieve it, although location 
'"formation mav be helpful m some situations Response latencies were similar for organized as 
"weii as disorganized content (Schumacher, Liebert and Fass (1975) reported that disorganized 
presentation of prose may in some instances facilitate recall) DiVesta and G/ay (1973) also 
demonstrated tnat scrambled passages were remembered significantly better than a completely 
logical ordering ot the passage People internalize coherent passages and may also reorganize 
■".coherent passages for nternal storage/ Eye movement study was found - to be a useful 
technique <n researching use of text location in recall 

. Yekovich'and Kuinavey (1976) reported structural and contextual effects-in the organization of 
prose providing support for the notion of active processing in learning and the interdependency 
ot context and organizational forms Immediate and delayed word recall and recognition reveafed 



organization form significantly affected retention and memory when 



subjects read passages 



organized around either name or attribute sequences Conceptually organized material facilitated 
r-ecaii subjects encountering coherently organized material are more likel/ to learn and remember 
man wnen they encounter haphazardly constructed material 

Sc h aiiert H976) investigated t-he role of context in determining how mich a person remembers 
from prose and exactly what is remembered using ambiguous paragraphs whose semantic 
r epresentat'on can be influenced by differing contexts Task instructions and time 6f*xposure 
to passages were manipulated to induce differential levels of processing land influence/amount of 
material retained Recall and recognition tests revealed subjects remembered more info/m^tion and 
T ore context consonant mformatign when given instructions which required processing the 
paragraphs at a semantic level Context was thus a powerful factor in determining which meaning 
ovi , fpfwmhciMO imm pciysemous paragraphs only when incoming information was processed at 
,i letter m,_>re Ae rT iantic "evei 
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